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FEATURES 

• Architecture supports scalability towards faster 
technologies 

• 15 times VAX™ 11/780 equivalent MIPS typical 
performance 

• 25 MHz (40 ns/cycle) operating frequency 

• Single-cycle execution for majority of instructions 

• Separate 32-bit address and data buses 

• 120 registers organized into seven register 
windows and eight global registers 

• Load/store architecture 

• 100 MBytes/sec. maximum memory bandwidth 

• 256 alternate address spaces, each four 
gigabytes 


• Address presentation which supports high- 
performance cache 

• Three-operand instruction format 

• Tagged operands which support Al 

• Floating point interface 

• Concurrent floating point operation with MB86911 

• 4-stage instruction pipeline 

• Delayed branch handling 

• 179-pin plastic pin grid array packaging 

• High-level language support 


The Fujitsu MB86901 has been designed for speed. It is a high-performance, 32-bit RISC (Reduced Instruc¬ 
tion Set Computer) processor which can be used in a wide variety of system applications including worksta¬ 
tions, minicomputers, and real-time systems. 

The MB86901 features a high-bandwidth, 32-bit bus interface with separate data and address buses, an op¬ 
timized four-stage instruction pipeline, and an execution unit comprised of a CPU, a barrel shifter, two data 
aligners, and a three-port register file consisting of 120 registers. A floating point interface supports concurrent 
floating point instruction execution. 


SPARC is a trademark of Sun Microsystems, Inc. 
VAX is a trademark of Digital Equipment Corp. 
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1. Introduction 

The Fujitsu MB86901 SPARC is a high-perform¬ 
ance, 32-bit, RiSC-architecture processor which ex¬ 
ecutes with 25 MiPS peak, and 15 MiPS sustained 
performance. 

The MB86901 instruction set is streamiined and 
hardwired for fast execution, with the majority of in¬ 
structions executing in a singie cycie. The processor 
features a four-stage instruction pipeiine which has 
been designed to handle data interlocks, an op¬ 
timized branch handler for efficient control transfers, 
and a bus interface that supports single-cycle 
memory accesses. 

The processor execution unit consists of a three- 
port ALU, a data aligner, and a barrel shifter. These 
units provide fast arithmetic, logical, and shift opera¬ 
tions. A separate Address Generation Unit calcu¬ 
lates load/store and branch addresses, enhancing 
processor performance by allowing the ALU to 
operate concurrently on data. 

An internal register file comprised of 120 registers 
organized into seven windows minimizes accesses 
to memory during procedure linkages and facilitates 
passing of parameters and assignment of variables. 
By reducing external memory accesses with their in¬ 
herent cycle overhead, the windowed register file 
significantly improves SPARC throughput. 

Separate data and address buses allow fast 
memory and I/O accesses without the delay and de¬ 
sign difficulties inherent in multiplexed buses. The 
32-bit data bus supports single cycle data transfers. 
The 32-bit address bus allows direct access to 4 
gigabytes of address space. Timing of the address 
presentation supports a high-performance cache. A 
separate floating point port directly interfaces to the 
MB86911 RISC Floating Point Controller which, to¬ 
gether with the Tl SN74ACT8847 floating point chip, 
allows concurrent floating point instruction execu¬ 
tion. 

These features combine to give the MB86901 supe¬ 
rior speed, power, and flexibility which can be easily 
utilized by the designer of high performance sys¬ 
tems. 

2. CPU 

The MB86901 processor is comprised of four pri¬ 
mary functional units: the Control Unit, the Execu¬ 


tion Unit, the Address Generation Unit, and the Reg¬ 
ister File. These units are supported by various reg¬ 
isters and interface logic, as shown in Figure 2.1. 

Three software-accessible registers configure and 
control operation of the processor. These are the 
Processor State Register (PSR), the Window Invalid 
Mask Register (WIM), and the Trap Base Register 
(TBR). 

A fourth software-accessible register, the Y register, 
supports 32 X 32-bit multiplication with 64-bit result. 

2.1. Instruction Pipeline 

The MB86901 uses a four-stage instruction pipeline 
to achieve an instruction execution rate approaching 
one instruction per clock cycle. A “single cycle” in¬ 
struction, i.e. one which occupies each stage of 
the pipeline for only one clock cycie, does not 
complete in one cycle. It enters the pipeline and 
completes four cycles later. During these four cy¬ 
cles, three more instructions may enter the pipeline. 
Once the pipeline is full, it is possible for one (single 
cycle) instruction to enter and one (single cycle) in¬ 
struction to exit the pipeiine every clock cycle. The 
stages of the MB86901 instruction pipeline are the 
Fetch Stage, the Decode Stage, the Execute Stage, 
and the Write Stage. Each stage utilizes one or 
more corresponding instruction registers, as shown 
in Figure 2.2. It should be pointed out that in the 
most general sense, instruction decoding is not con¬ 
fined exclusively to the Decode Stage—in each 
stage of the pipeline, the processor determines how 
the associated resources are to be utilized. 

2.1.1. Fetch Stage 

During the Fetch Stage, the processor outputs an 
instruction address, inputs the instruction via the 
data bus, and transfers it to the Instruction (Decode) 
Register to be decoded, in the event that this regis¬ 
ter cannot accept the instruction immediately be¬ 
cause of a pipeiine delay—possibly due to a 
preceding multi-cycle instruction being executed— 
the instruction is queued in the Instruction Buffer, 
which holds instructions until pipeline forwarding 
recommences. These buffer registers are used only 
as necessary to keep the pipeline full and fully util¬ 
ize the external bus bandwidth. 

2.1.2. Decode Stage 

During the Decode Stage, the processor computes 
the register file addresses of required source oper- 
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Figure 2.1 Block Diagram 



Figure 2.2 Pipeline Registers 
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ands, reads the source operands from the register 
file, and computes the next instruction address. 
Source operands are available to the processor for 
use during instruction execution. 

2.1.3. Execute Stage 

During the Execute Stage, arithmetic and logical 
operations are performed, and the results are saved 
in (the processor’s) temporary registers. Integer 
arithmetic, logical, and shift operations are executed 
in the processor’s integer ALU; floating point opera¬ 
tions are executed in a separate floating point co¬ 
processor. 

2.1.4. Write Stage 

During the Write Stage, the processor stores the 
operation results in the destination register. This up¬ 
date of the register file can occur only if no traps or 
exceptions have occurred during execution. If in¬ 
struction execution results in a trap, the write to the 
register file is aborted. 

Note that all operands are read from the processor 
register file, and all execution results are written into 
the register file. This Load/Store architecture, which 


is characteristic of RISC processors in general, dic¬ 
tates that data transfers between the processor and 
memory occur only through the use of Load and 
Store instructions. 

2.1.5. Pipeline Examples 

A peak instruction execution rate of one instruction 
per cycle is achieved by the processor when a 
sequence of single-cycle instructions—^those which 
occupy each pipeline stage for only one clock cycle 
(see Table 3.2)—passes through the pipeline. 

Figure 2.3 shows single-cycle instructions, none of 
which require memory access, moving through the 
instruction pipeline. Each instruction passes through 
each of the four pipeline stages in one cycle. Note 
that the instructions are fetched during consecutive 
clock cycles. 

Multiple-cycle instructions and instructions which 
access memory require more than four cycles to 
pass through the pipeline. Figure 2.4 shows a pipe¬ 
line example in which the first instruction, a 2-cycle 
Load instruction fetched during clock cycle #1, re¬ 
quires one extra cycle to complete. Extra cycles are 


Figure 2.3 Single-cycle Instruction Pipeline Progression 
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required when the processor needs to use the bus 
for something other than fetching the next instruc¬ 
tion. These extra cycles of multi-cycle instructions 
are actually generated by internal opcodes (lOPs) 
which are inserted into the pipeline as needed (see 
Figure 2.4.1 in which a store instruction requires 2 
extra cycles). A two-stage Instruction Buffer is used 
to hold instructions prior to decoding, so that the 
processor may continue to fetch additional instruc¬ 
tions while internal opcodes are injected into the 
pipeline (see Figure 2.2). Only two stages are re¬ 
quired because a maximum of two extra cycles will 
elapse between the issue of a multicycle instruction 
and the presentation of an address for bus access. 

With pipelined execution, it is possible that an in¬ 
struction may try to use the contents of a register 
that is in the process of being updated by a previous 
instruction. Special bypass paths in the MB86901 
make the correct data available to following instruc¬ 
tions for all internal register to register operations 


(see Figure 2.1), but do not solve the problem for 
loads to the registers from external memory. Inter¬ 
lock hardware prevents an instruction following a 
Load instruction from reading the register being 
loaded until the load is complete. This is accom¬ 
plished by the processor generating a one-cycle 
delay as shown in Figure 2.4.2. 

While this operation is transparent to software, com¬ 
pilers and assembly language programmers should 
avoid Loads followed immediately by instructions 
using the register contents. 

2.1.6. Delayed Control Transfers 

Instructions are fetched and executed from sequen¬ 
tial memory locations until a control transfer occurs. 
A control transfer results from execution of a 
Branch, Jump, Call, or Return From Trap instruction 
or from a trap. 


Figure 2.4 Two-cycle Instruction Pipeline Progression 
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Figure 2.4.1 Three-cycle Instruction Pipeline Progression 
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Figure 2.4.2 Illustration of Register Interlock 
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Traditional processors execute the first instruction of 
a branch target immediately after executing the in¬ 
struction which causes the transfer. However, this 
method of control transfer is not very efficient in 
pipelined RISC architectures, since instructions 
which are in the pipeline when a control transfer oc¬ 
curs must be “flushed”. Thus, instructions which 
have been fetched after a control transfer instruction 
would normally be discarded, and the pipeline “re¬ 
filled” with instructions from the branch target—re¬ 
sulting in wasted cycles. 

The MB86901, however, optimizes instruction pipe¬ 
line utilization by providing delayed control transfer 
for the Branch, Call, Jump and Link, and Return 
From Trap instructions. The processor always 
fetches the instruction, called a delay instruction, 
which foliows a delayed control transfer instruction. 
This delayed branch instruction feature of the 
MB86901 allows branches (whether taken or not) to 
occur without causing any flushes of the pipeline as 
shown in Figure 2.4.3a. If the compiler or program¬ 
mer cannot place an appropriate instruction—i.e. 
one which is suitable whether the branch is taken or 
not—in the delay slot, the instruction fetched may 
be annulled as shown in Figure 2.4.3b. (See discus¬ 
sion of Annul Bit under Section 3.3.1.) Figure 2.8 in 
Section 2.5.3 shows an example of delay instruction 
execution. 

Whenever the processor pipeline is frozen as the re- 
sult of a n e xternally generated hold input such as 
MHOLD or BHOLD, the pipeline stalls in the ex¬ 
ecute phase of the instruction which caused the 
hold as shown in Figure 2.4.4. 

Pending traps, either synchronous or asynchronous, 
are prioritized and taken during the write phase of 
each instruction as shown in Figure 2.4.5. Instruc¬ 
tions in the pipeline are annulled following the in¬ 
struction which trapped. 

2.2. Execution Unit 

The MB86901 Execution Unit includes a fast, 32-bit 
carry-look-ahead integer Arithmetic Logic Unit 
(ALU), a 32-bit barrel shifter, and a Data Aligner as 
shown in Figure 2.1. 

The ALU executes all processor arithmetic and logic 
operations in one cycle, and sets the processor In¬ 
teger Condition Code flags (icc<23;20> in the Pro¬ 


cessor State Register, see Section 2.5.2.1) accord¬ 
ing to operation results. It has two input ports, and 
one output port. 

The ALU receives inputs from the A and B Operand 
registers. By means of multiplexers, these registers 
may receive source operands from the register file 
or pipeline bypasses or instruction immediate oper¬ 
ands from instruction pipeline registers. The ALU re¬ 
sult is captured in the Result register, from which 
data is transferred to the register file via the Data 
Aligner. 

Two data bypasses are used to handle data de¬ 
pendencies between consecutive instructions (see 
Figure 2.1). 

One bypass transfers the output of the ALU directly 
to the Operand registers, bypassing the Result reg¬ 
ister. This bypass is used when a source operand of 
an instruction depends on the result of the previous 
instruction. 

The second bypass transfers the output of the Re¬ 
sult register directly to the Operand registers. This 
bypass is used when an instruction source operand 
depends on the result of the instruction two cycles 
previous. 

The Data Aligner is used during Load instruction ex¬ 
ecution to align bytes and halfwords transferred to 
the register file. 

Closely related to the ALU is the barrel shifter which 
shifts, in one cycle, 32-bit data by the count 
specified by instructions, and is used during Store 
instruction execution to align bytes and halfwords 
transferred to the system data bus. 

2.3. Address Generation Unit 

The Address Generation Unit contains logic which 
calculates addresses according to instruction format 
and address mode selection, relieving the integer 
ALU of the task. The unit calculates addresses util¬ 
izing instruction displacement data for Format 1 in¬ 
structions, immediate data for Format 2a in¬ 
structions, displacement data for Format 2b instruc¬ 
tions, register file data or register file and immediate 
data, respectively, for Format 3a and Format 3b in¬ 
structions, and Trap Base Register data for trap 
address calculation (see Section 3 for a detailed dis¬ 
cussion of Instruction Formats). 
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Figure 2.4.3a Delay Instruction Executed During Branch Taken (a=0) 
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Figure 2.4.3b Delay Instruction Annulled (a=l) During Branch Not Taken 
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Figure 2.4.4 Instruction Fetch (Cache Miss) 
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Figure 2.4.5 Trap Handling in Pipeline 
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Included in the Address Generation Unit are four 
program counter registers. Two of the registers, The 
Program Counter and Next Program Counter regis¬ 
ters, are saved during trap servicing to allow re¬ 
sumption of program execution after traps are 
serviced. 

2.4. Address Space Organization 

The MB86901 processor is capable of supporting up 
to 256 address spaces of 4 gigabytes each. The ex¬ 
ternal system distinguishes these spaces by means 
of an address space identifier (asi) generated by the 


processor. These spaces can include system control 
registers, main memory, etc. The SPARC architec¬ 
ture defines four address spaces and their asi 
values as follows: 


Address Space 

Identifier (asi) Address Space 


00001000 

00001001 

00001010 

00001011 


User Instruction 
Supervisor Instruction 
User Data 
Supervisor Data 


Figure 2.5 Data Types 
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These four asi values indicate to the external sys¬ 
tem whether the processor is in user or supervisor 
mode, as indicated by the Processor State Register 
(see Section 2.5.2.1), and whether the access is an 
instruction or data reference. Load/store instructions 
normally generate an asi of either 00001010 (user 
mode) or 00001011 (supervisor mode) for data 
access. However, load from alternate space and 
store into alternate space instructions, which are 
privileged instructions executable only in supervisor 
mode, use the asi value supplied by the instruction 
itself (see Section 3.1). This asi value may be one of 
the remaining 252 which are implementation defi¬ 
nable. 

Address spaces are selected by the ASI<7:0> bus 
interface signals as follows: 


ASi<7:0> 

Address Space 

0-7 

Implementation Definable 

8 

User Instruction Space 

9 

Supervisor Instruction Space 

10 

User Data Space 

11 

Supervisor Data Space 

12-255 

Implementation Definable 


2.4.1. Processor Data Types 

The MB86901 processor supports seven integer 
and three ANSI/IEEE 754-1985 standard floating 
point data types, as shown in Figure 2.5. The in¬ 
teger data types include byte, unsigned byte, half¬ 
word, unsigned halfword, word, unsigned word, and 
double word. Floating point types include single, 
double, and extended precision. 

The floating point double type includes two sub¬ 
fields: 1) the double-e, which contains the sign, ex¬ 
ponent, and high-order fraction, and 2) the 
double-f, which includes the low order fraction. The 
floating point extended type includes four subfields: 
1) the extended-e, which contains the sign and ex¬ 
ponent, 2) the extended-f, which contains the in¬ 
teger part of the mantissa, and the high order part of 
the fraction, 3) the extended-f-Iow, which contains 
the low-order fraction, and 4) the extended-u, which 
is unused. 

Tagged data for arithmetic operations on dynami¬ 
cally allocated variables is also supported. This type 
of data is useful in Lisp and Smalltalk programming 
environments. 


2.4.2. Addressing Conventions 

Load and store instructions follow the big endian ad¬ 
dressing convention: 

Bytes 

For load and store byte instructions, increasing 
the address means decreasing the significance of 
the byte within the word. The most significant 
byte (MSB) of a word is accessed when address 
bits <1:0> = 0, and the least significant byte is 
accessed when <1:0> = 3. 

Halfwords 

For load and store halfword instructions, the most 
significant halfword is accessed when address bit 
<1> = 0, and the least significant halfword is 
accessed when <1 > = 1. 

Doublewords 

For load and store doubleword instructions, the 
most significant word is accessed when address 
bit <2> = 0, and the least significant word is 
accessed when <2> = 1. 

In general, the address of a doubleword, word, or 
halfword is the address of its most significant byte. A 
doubleword datum is located at a doubleword 
address, which must be evenly divisible by eight. A 
word datum is located at a word address, which 
must be evenly divisible by four. A halfword datum is 
located at a halfword address, which must be evenly 
divisible by two. If a doubleword, word, or halfword 
load or store instruction generates an improperly 
aligned address, a “memory address not aligned” 
trap occurs. 

Figure 2.5.1 Addressing Conventions 
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2.5. Registers 

The MB86901 (lU) features two types of registers: 
working registers (called r registers), which are 
used for normal operations, and control/status regis¬ 
ters, which keep track of and controi the state of the 
ID. All r registers are 32 bits wide. They are divided 
into eight giobal registers and a number of overlap¬ 
ping blocks called windows. Each window contains 
24 registers. Because of the overlap, 16 registers 
are counted for each window. Therefore, with 7 win¬ 
dows and eight global registers, the MB86901 has a 
totai of 120 working registers. 


The Ill’s control/status registers are ali 32-bit 
read/write registers unless specified otherwise. 
They include the program counters (PC and nPC), 
the Processor State Register (PSR), the Window In¬ 
valid Mask Register (WIM), the Trap Base Register 
(TBR), and the multiply-step (Y) register. These reg¬ 
isters are discussed in detail in Section 2.5.2, and 
are shown in Figure 2.6. 

2.5.1. Register File 

The MB86901 register file contains 120, 32-bit reg¬ 
isters. 


Figure 2.6 Programming Model 
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Eight of the registers, r0-r7, are global general-pur¬ 
pose registers which are accessible at all times. 112 
of the registers are grouped into seven windows 
which overlap to form a circular stack, with only one 
window, the active window, accessible at any time. 

Figures 2.7 and 2.7.1 show a representation of the 
register windows, which are numbered contiguously 
from 0 to 6. In general, the number of windows 
(NWINDOWS) could range from 2 to 32, depending 
on the implementation. Implemented windows must 
be numbered contiguously from 0 to NWINDOWS - 
1. The windows are addressed by the CWP (Current 
Window Pointer), a field of the Processor State Reg¬ 


ister. The active window is defined as the one cur¬ 
rently pointed to by the CWP. At most 6 (NWiN- 
DOWS - 1, in general) are available to user code 
since one window must be availabie for trap han¬ 
dlers. 

At any given time, there are 32 working registers 
available to a given process. Registers r0-r7 are 
global registers which are equaliy avaiiable from all 
windows. The active window provides an additional 
24 registers designated r8-r3l. These registers are 
segmented into OUT registers (r8-r15), LOCAL reg¬ 
isters (r16-r23), and IN registers (r24-r31). 


Figure 2.7 Register Fiie with Windows 
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The LOCAL registers are unique to each window. 
The IN and OUT registers, however, are shared be¬ 
tween adjacent windows. The OUTS from a pre¬ 
vious window (CWP + 1) are the INS of the active 
(current) window, and the OUTS from the active win¬ 
dow are the INS of the next window (CWP -1). The 
windows are joined together in a circular stack, 
where the highest numbered window is adjacent to 
the lowest: the INS of window 6 are the OUTS of 
window 0. 

The number of registers in the IN, LOCAL, and OUT 
window segments may be varied by software con¬ 
vention, but the total number of registers in the win¬ 


dow may not exceed 24. The number of IN or OUT 
registers must be no greater than eight, and the 
number of IN and OUT registers must be equal. For 
instance, r8-r12 can be designated OUT registers, 
r13-r26 can be designated LOCAL registers, and 
r27-r31 can be designated IN registers for a total of 
5 OUT registers, 14 LOCAL registers, and 5 IN reg¬ 
isters. 

2.5.1.1. Window Selection 

The CWP field (bits <4:0>) of the Processor State 
Register (see Section 2.5.2.1) acts as a pointer to 
the active window. This field is written with the 
WRPSR (Write Processor State Register) instmc- 


Figure 2.7.1 Register Windows 
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tion, is decremented by the SAVE instruction, and is 
incremented by the RESTORE and RETT (Return 
From Trap) instructions. The CWP cannot point to 
an unimplemented window; therefore arithmetic 
done on the CWP is done modulo the number of im¬ 
plemented windows (7 for the MB86901, NWIN- 
DOWS in general). The WRPSR instruction does 
not write the PSR and causes an “illegal instruction” 
trap if the result would cause the CWP field to point 
to an unimplemented window. 

The Window Invalid Mask (WIM) Register is a regis¬ 
ter which, under software control, detects the occur¬ 
rence of lU register file overflows and underflows. 
This register is further described in Section 2.5.2.2. 

2.5.1.2. Register Window Use 

In a conventional, non-windowed architecture, there 
is significant overhead associated with procedure 
call linkage. This overhead is the time and bus traf¬ 
fic required to save register contents and transfer 
parameters on entry to a procedure and to restore 
previous register contents and transfer results on re¬ 
turn from the called procedure. 

In the windowed architecture of the MB86901, the 
general registers visible to the user are a “window” 
into the larger register file. A procedure linkage 
mechanism can effectively “save” local variables in 
registers by simply moving the “window” to a fresh 
set of registers for the called procedure. This is ac¬ 
complished by decrementing the CWP. Similarly, the 
previous register contents can be restored by 
changing the window pointer to indicate the original 
window. Since only a pointer is changed to bring 
about save and restore operations, procedure link¬ 
age overhead is greatly reduced. 

Because the processor logically provides new locals 
and outs after every procedure call, register local 
values need not be saved and restored across calls. 
For instance, the local variables of a procedure in 
window “n” remain unaltered during trap servicing 
because the trap service routine uses window “n-1”, 
with its own set of local registers. Once trap servic¬ 
ing is complete and the Return From Trap instruc¬ 
tion is executed, the procedure continues with 
window “n” active again and containing the unal¬ 
tered local values. 

The overlapping registers also minimize the over¬ 
head of passing and returning parameters. In pre¬ 
paration for a call, a procedure can move para¬ 


meters required by the called routine into the cal¬ 
ler’s OUT registers. After the CALL, the CWP is 
decremented with the SAVE instruction, and what 
was the “next” window becomes the active window 
with the parameters directly accessible to the called 
routine in its IN registers, since the OUT registers of 
the caller’s register window are the IN registers of 
the called routine’s register window. 

Likewise, in preparing for a procedure return, the 
called routine can then move its results into its IN 
registers. After the CWP is incremented via the RE¬ 
STORE instruction what was the “previous” window 
becomes the active window once again with the re¬ 
sults available to the calling procedure, since the IN 
registers of the called routine’s register window be¬ 
come the OUT registers of the calling procedure’s 
register window. Note that the terms IN and OUT 
are defined relative to calling, not returning. 

Window Overflow/Underflow 

Since only a finite number of register windows are 
available, the register file can fill if the number of 
procedure calls exceeds the number of procedure 
returns by 6 (NWINDOWS - 1, in general). A sub¬ 
sequent call results in a Window Overflow trap (see 
Table 5.1). The overflow trap service routine must 
move one or more windows from the register file to 
memory (IN and LOCAL registers only) to release 
register window(s) for use. 

Similarly, the register file can become empty if the 
number of procedure returns exceeds the number of 
calls by 6 (NWINDOWS - 1, in general). A sub¬ 
sequent return results in a Window Underflow trap. 
The underflow trap service routine can move pre¬ 
viously saved windows from memory into the regis¬ 
ter file. 

The SAVE instruction automatically checks for win¬ 
dow overflow, and the RESTORE instruction auto¬ 
matically checks for window underflow according to 
the contents of the Window Invalid Mask Register. 
The Window Invalid Mask Register and its as¬ 
sociated traps are further discussed in Section 
2.5.2.2. 

The representation of the register windows in Figure 
2.7.1 illustrates the seven windows. Assume W6 is 
the oldest window, and WO is the newest and corre¬ 
sponds to a procedure that attempted to execute a 
SAVE instruction and generated a window-overflow 


15 




MB86901 


lllllllllllllllllllllllllllllllllllilllllllllllll 


FUJITSU 


trap. The trap handler cannot use W6’s ins or Wl’s 
outs, but it is always guaranteed WO’s locals. 

Dedicated Register File Registers 

Several registers have special uses which must be 
considered when using the register file. 

If global register rO is addressed as a source oper¬ 
and (rs1 or rs2 = 0), the operand value of 0 is re¬ 
turned. If rO is addressed as a destination operand 
(rd = 0), no register is modified (see Instmction For¬ 
mats, Section 3.1, for operand addressing descrip¬ 
tions). The rO register is therefore a convenient 
source of 0 value for software, and a convenient 
destination register for instructions which are ex¬ 
ecuted for condition code results only. 

When a CALL instruction is executed, the current 
value of PC, which points to the CALL itself, is 
saved in Register 15 (r15) of the active window prior 
to the transfer of control to the CALL target. The 
called routine returns by a Jump and Link (JMPL) in¬ 
struction (see Section 3.3.3) to the value in r15, plus 
an offset to account for the CALL and its delay in¬ 
struction. In other words, the typical return address 
is the value in r15 plus 8. Note that the called 
routine must not use r15 without first saving the re¬ 
turn address contained within. 

Each trap saves the program counters (PC and 
nPC) into registers r17 and r18, respectively, of the 
“next” window for use in returning from the traps. 
These registers must therefore not be used by the 
trap service routines unless they are first saved in 
memory. 

2.5.2. Special Purpose Registers 

The MB86901 has four 32-bit, read/write registers 
which are used for special purposes. The Processor 
State Register (PSR), the Window invalid Mask reg¬ 
ister (WIM), and the TRAP Base Register (TBR), are 
control/status registers. The fourth register, the Y 
register, is used during multiplication to create 64-bit 
products. 


2.S.2.I. Processor State Register (PSR) 

The Processor State Register is the primary proces¬ 
sor controi and status register, it contains 11 mode 
and status fields which configure the processor and 
report processor status and execution results. The 
mode fields, shown in upper case in the following 
PSR register diagram, are set by the operating sys¬ 
tem to configure the processor. The status fields, 
shown in lower case, are set by the processor to in¬ 
dicate effects of instruction execution. 

The PSR, with the exception of several fields de¬ 
scribed beiow, is written and read directly with the 
privileged WRPSR (Write PSR) and RDPSR (Read 
PSR) instructions. It is modified by the SAVE, RE¬ 
STORE, Ticc,and RETT instructions, as weli as in¬ 
structions which affect the Integer Condition Code 
(icc) flags—bits <23:20>. 

The PSR fields and their respective bits are: 

impl<31:28>: Implementation 

This field identifies the impiementation number of 

the processor; it is hardwired to 0 in the MB86901 

processor. 

ver<27:24>: Version Number 
This fieid identifies the processor version, and is in¬ 
tended for factory use. This field is hardwired to 0 in 
the MB86901 processor and can be read, but can¬ 
not be written. 

icc<23:20>: Integer Condition Codes 

This field contains the negative (n), zero (z), over- 
fiow (v), and carry (c) integer condition code flags. 
These bits are modified by the WRPSR instruction, 
and by arithmetic and logical instructions whose 
names end with the letters cc (for example, ANDcc). 
The Bicc and Ticc instructions base their controi 
transfers on these bits, which are defined as fol¬ 
lows: 


31 28 
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n<23>—Set equal to 1 if the ALU result was 
negative for the last instruction that modified the 
icc field; equal to 0 othenwise. 

z<22> —Set equal to 1 if the ALU result was zero 
for the last instruction that modified the icc field; 
equal to 0 otherwise. 

v<21>—If this bit equals 1, an arithmetic over¬ 
flow occurred during the last instruction that 
modified the icc field; equal to 0 otherwise. Logi¬ 
cal instructions that modify the icc field always 
set the overflow bit equal to 0. 

c<20> —If this bit equals 1, either an arithmetic 
carry out of bit 31 occurred as the result of the 
last addition that modified the icc, or a borrow 
into bit 31 occurred as the result of the last sub¬ 
traction that modified the icc; equal to 0 
otherwise. Logical operations that modify the icc 
field always set the carry bit to 0. 

reserved<19:13>: Reserved Field 

This field is reserved and should only be written to 0 

with the WRPSR instruction. 

EC: Enable Coprocessor 

This bit determines whether the coprocessor is 
enabled or disabled. 1 = enabled, 0 = disabled. 

EF: Enable Floating Point Unit 

This bit determines whether the floating point unit is 

enabled, or disabled. 1 = enabled, 0 = disabled. 

If the FPU is either disabled, or enabled and not 
present, an FPop, FPfcc, or floating point load/store 
instruction causes an fp disabled trap. Similarly, if 
the coprocessor (CP) is either disabled, or enabled 
and not present, a CPop, CPccc, or coprocessor 
load/store instruction causes a cp disabled trap. 

When the FPU (or CP) is disabled, it retains its state 
until it is reenabled or reset. When disabled, the 
FPU can continue to execute instructions in its 
queue. The CP can also, if it has a queue. 

When the FPU is present, software can use the EF 
bit to determine whether a particular process uses 
the FPU. If a process does not use the FPU, the 
FPU’s registers need not be saved and restored 
across context switches. Also, if the FPU is not pre¬ 
sent, (as indicated by the bp_FPUj3resent signal), 
the fp disabled trap can be used to emulate the 


floating point instruction set. (This also applies to 
the coprocessor.) 

PIL<11:8>: Processor Interrupt Level 

This field identifies the processor interrupt level, 
with the processor accepting only interrupts with 
level 15 (non-maskable interrupts), or with levels 
higher than the value in the PIL field (maskable in¬ 
terrupts). Bit 11 is the most significant bit, and bit 8 
is the least significant bit. 

S<7>: Supervisor Mode 

This mode bit determines whether the processor is 
in supervisor or user mode: when S = 1, the proces¬ 
sor is in supervisor mode. 

Note that because the instructions to write the PSR 
are available only in supervisor mode, the supervi¬ 
sor mode can be entered from only by hardware 
reset, or by a software or hardware trap. 

PS<6>: Prior S State 

The PS bit records the state of the S bit when each 
trap is entered so that the processor can return from 
the trap in the proper supervisor or user mode. 

Processor hardware changes the PS bit to the state 
of the S bit when entering a trap, then changes the 
S bit to the state of the PS bit when returning from 
the trap. 

ET<5>: Enable Traps 

Traps are enabled when ET = 1. When ET = 0, traps 
are disabled and all asynchronous traps (interrupts) 
are ignored. Synchronous traps and floating 
point/coprocessor traps cause t he lU to halt and 
enter error mode (see ERROR description. Table 
7.1). 

CWP<4:0>: Current Window Pointer 
The CWP field points to the register window which 
is currently active. CWP is written and read with the 
WRPSR and RDPSR instructions, is decremented 
by traps and the SAVE instruction, and is incre¬ 
mented by the RESTORE and RETT instructions. 
The CWP cannot point to an unimplemented win¬ 
dow. Attempting to write a value to the CWP field 
which points to an unimplemented window results in 
an “illegal instruction” error. Therefore, arithmetic 
done on the CWP is done modulo 7 to ensure that it 
always points to one of the seven implemented win¬ 
dows. 
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2.5.2.2. Window Invalid Mask Register (WIM) 

The WIM register contains seven register window 
mask bits, each of which corresponds to an imple¬ 
mented register window. If a SAVE, RESTORE, or 
RETT would cause the CWP to point to a window 
whose corresponding WIM bit equals 1, it causes a 
Window Overflow (SAVE) or Window Underflow 
(RESTORE, RETT) trap. 

The WIM register can be written with the WRWIM 
instruction, and the register can be read with the 
RDWIM instruction. Bits corresponding to unimple¬ 
mented windows are read as zeroes, and values 
written to unimplemented bits are ignored. 

The WIM provides the following fields: 

reserved<31:7>: Reserved Field 

This field is reserved for potential future expansion 

to additional windows. 

W6-W0<6:0>: Window Masks 
Each of these bits is a register window mask bit, 
with WO the mask bit for Window 0, W1 the mask bit 
for Window 1, etc. 

2.5.2.3. Trap Base Register (TBR) 

The Trap Base Register contains three fields which 
generate the address of the trap handler when a 
trap occurs. These are: 

TBA<31:12>: Trap Base Address 
The TBA field contains the base address of the Trap 
Dispatch Table, which is controlled by software. The 
most significant 20 bits of the trap tabie base 
address are written into the TBA field with the 
WRTBR instruction. Note that the reset trap is an 
exception; it traps to address 0. 


tt<ll:4>: Trap Type 

Each trap is identified by a unique 8-bit trap type 
number. The processor writes the appropriate trap 
type number into the tt field of the TBR when it rec¬ 
ognizes a trap, then uses the number as an offset 
into the Trap Dispatch Table. The tt field remains un¬ 
changed until the next trap occurs. The WRTBR in¬ 
struction does not affect the tt field. 

nuil<3:0>: Null Field 

This field is hardwired to 0 to force word access in 
the Trap Dispatch Table. The WRTBR instruction 
does not affect this field. 

The entire TBR register can be read with the 
RDTBR instruction. Use of the TBR is further de¬ 
scribed in Section 5.4, Trap Addressing. 

2.5.2.4. Y Register 

The multiply step instruction (MULScc) uses the 32- 
bit Y register to generate 64-bit products. Use of the 
Y register is further described in Section 3.2.3. 

The Y register can be written with the WRY instruc¬ 
tion, and read with the RDY instruction. 

2.5.3. Program Counters 

Two registers, the Program Counter (PC) and the 
Next Program Counter (nPC), are used by the pro¬ 
cessor for instruction addressing. 


Figure 2.8 Program Counter Sequencing 
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The PC register contains the word address of the in¬ 
struction currently being executed by the lU, and the 
nPC register contains the word address of the next 
instruction to be executed, assuming a trap does not 
occur. In delayed control transfers (see Section 3.3), 
the instruction that immediately follows a control 
transfer (the delay instruction) may be executed 
before control is transferred to the target. The nPC 
is necessary to implement this feature. Most instruc¬ 
tions complete by copying the contents of the nPC 
into the PC, then either increment nPC by 4, or, if 
the instruction implies a control transfer, write the 
computed target address into nPC. The PC now 
points to the instruction which will be executed next, 
and the nPC points to the instruction which will be 
executed after the next one; in other words, two in¬ 
structions hence. 

Figure 2.8 shows PC and NPC changes during in¬ 
struction execution. Note that the delay instruction is 
executed before the control transfer occurs. 

The PC and NPC registers cannot be directly written 
or read. 

3. Instruction Set 

The Fujitsu MB86901 instruction set is comprised of 
66 basic instructions which are categorized into four 
main functional groups as shown in Table 3.1. 

Mnemonics for the complete MB86901 instruction 
set may be obtained from the above table of basic 
instructions by forming all possible combinations of 
boldface letters. For example, the complete set of 
load instructions is: 

LDSB Load Signed Byte 

LDSBA Load Signed Byte from Alternate space 

LDSH Load Signed Halfword 

LDSHA Load Signed Halfword from Alternate space 

LDUB Load Unsigned Byte 

LDUBA Load Unsigned Byte from Alternate space 

LDUH Load Unsigned Halfword 

LDUHA Load Unsigned Halfword from Alternate space 

LD Load word 

LDA Load word from Alternate space 

LDD Load Doubleword 

LDDA Load Doubleword from Alternate space 


Table 3.1 Instruction Functional Groups 


DATA TRANSFER: 




LoaD 

Signed 

Byte 

normal 



Unsigned 

Haifword 

Alternate 

8 

LoaD 


word 

normal 




Doubleword 

Alternate 

4 

STore 


Byte 

normal 




Halfword 

Alternate 




word 





Doubleword 


8 

atomic SWAP word 





atomic Load-Store Unsigned Byte 2 

ARITHMETIC/LOGICAL/SHIFT: 


ADD 

normal 

normal 


SUB 

extended 

set CC 

8 

AND 

normal 

normal 


OR 

Not 

set CC 



XOR 12 

Shift Left Logical 
Shift Right Logical 

Shift Right Arithmetic 3 

Tagged ADD set CC normal 

SUB set CC Trap overflow 4 

MULtiply Step set CC 

SET High 

SAVE 

RESTORE 4 

CONTROL TRANSFER: 

Branch on Integer Condition Code 

CALL 

Jump and Link 
RETurn from Trap 

Trap on Integer Condition Code 5 

READ/WRITE CONTROL REGISTER: 

ReaD PSR 
WRite TBR 
WIM 

Y 8 

TOTAL NUMBER OF INSTRUCTIONS: W 
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Table 3.2 

Instruction Set 





Instruction 

Mnemonic 

Format 

OP 

OP2 

OPS 

opf 

Cycles 

Absolute Value ** 

FABS 

3c 

2 


34 

009 


Add 

ADD 

3a/3b 

2 


00 


1 

Add and Set Condition Code 

ADDCC 

3a/3b 

2 


10 


1 

Add with Carry 

ADDX 

3a/3b 

2 


08 


1 

Add with Carry and Set Condition Code 

ADDXCC 

3a/3b 

2 


18 


1 

Add Double ** 

FADDD 

3c 

2 


34 

042 


Add Single ** 

FADDS 

3c 

2 


34 

041 


And 

AND 

3a/3b 

3 


01 


1 

And and Set Condition Code 

ANDCC 

3a/3b 

3 


11 


1 

And Not 

ANDN 

3a/3b 

3 


05 


1 

And Not and Set Condition Code 

ANDNCC 

3a/3b 

3 


15 


1 

Atomic Load-Store Unsigned Byte 

Atomic Load-Store Unsigned Byte In 

LDSTUB 

3a/3b 

3 


OD 


4 

Alternate Space * 

LDSTUBA 

3a/3b 

3 


ID 


4 

Branch on Integer Condition Code 

BICC 

2b 

0 

2 



1-2(1) 

Branch on Floating Point Condition Code 

FBFCC 

2b 

0 

6 



1-2(1) 

Call 

CALL 

1 





1 

Compare Double ** 

FCMPD 

3c 

2 


35 

052 


Compare Double and Exception if Unordered ** 

FCMPED 

3c 

2 


35 

056 


Compare Single ** 

FCMPS 

3c 

2 


35 

051 


Compare Single and Exception if Unordered ** 

FCMPES 

3c 

2 


35 

055 


Convert Double to Integer ** 

FDTOI 

3c 

2 


34 

0D2 


Convert Double to Single ** 

FDTOS 

3c 

2 


34 

0C6 


Convert Integer to Double ** 

FITOD 

3c 

2 


34 

0C8 


Convert Integer to Single ** 

FITOS 

3c 

2 


34 

0C4 


Convert Single to Double ** 

FSTOD 

3c 

2 


34 

0C5 


Convert Single to Integer ** 

FSTOI 

3c 

2 


34 

0D1 


Divide Double ** 

FDIVD 

3c 

2 


34 

04E 


Divide Single ** 

FDIVS 

3c 

2 


34 

04D 


Exclusive Nor 

XNOR 

3a/3b 

3 


07 


1 

Exclusive Nor and Set Condition Code 

XNORCC 

3a/3b 

3 


17 


1 

Exclusive Or 

XOR 

3a/3b 

3 


03 


1 

Exclusive Or and Set Condition Code 

XORCC 

3a/3b 

3 


13 


1 

Inclusive Or 

OR 

3a/3b 

3 


02 


1 

Inclusive Or and Set Condition Code 

ORCC 

3a/3b 

3 


12 


1 

inclusive Or Not 

ORN 

3a/3b 

3 


06 


1 

Inclusive Or Not and Set Condition Code 

ORNCC 

3a/3b 

3 


16 


1 

Jump and Link 

JMPL 

3a/3b 

2 


38 


2 

Load Double Word 

LDD 

3a/3b 

3 


03 


3 

Load Double Word From Alternate space * 

LDDA 

3a/3b 

3 


13 


3 

Load Double Floating Point Register 

LDDF 

3a/3b 

3 


23 


3 

Load Floating Point Register 

LDF 

3a/3b 

3 


20 


2 

Load Floating Point Status Register 

LDFSR 

3a/3b 

3 


21 



Load Signed Byte 

LDSB 

3a/3b 

3 


09 


2 

Load Signed Byte From Alternate space * 

LDSBA 

3a/3b 

3 


19 


2 

Load Signed Halfword 

LDSH 

3a/3b 

3 


OA 


2 

Load Signed Halfword From Alternate space * 

LDSHA 

3a/3b 

3 


1A 


2 

Load Unsigned Byte 

LDUB 

3a/3b 

3 


01 


2 

Load Unsigned Byte From Alternate space * 

LDUBA 

3a/3b 

3 


11 


2 

Load Unsigned Halfword 

LDUH 

3a/3b 

3 


02 


2 

Load Unsigned Halfword From Alternate space * 

LDUHA 

3a/3b 

3 


12 


2 

Load Word 

LD 

3a/3b 

3 


00 


2 

Load Word From Alternate space * 

LDA 

3a/3b 

3 


10 


2 
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Table 3.2 Instruction Set (cont.) 


Instruction 

Mnemonic 

Format 

OP 

OP2 

OPS 

opf 

Cycles 

Move ** 

FMOVS 

3c 

2 


34 

001 


Multiply Double ** 

FMULD 

3c 

2 


34 

04A 


Multiply Single ** 

FMULS 

3c 

2 


34 

049 


Multiply Step and Set Condition Code 

MULSCC 

3a/3b 

3 


24 


1 

Negate ** 

FNEGS 

3c 

2 


34 

005 


Read Processor State Register * 

RDPSR 

3(2) 

2 


29 


1 

Read Trap Base Register 

RDTBR 

3(2) 

2 


2B 


1 

Read Window Invalid Mask Register * 

RDWIM 

3(2) 

2 


2A 


1 

Read Y Register 

RDY 

3(2) 

2 


28 


1 

Restore 

RESTORE 

3a/3b 

2 


3D 


1 

Return from Trap * 

RETT 

3a/3b 

2 


39 


2 

Save 

SAVE 

3a/3b 

2 


3C 


1 

Set r Register High 22 bits 

SETHI 

2a 

0 

4 



1 

Shift Left Logical 

SLL 

3a/3b 

3 


25 


1 

Shift Right Arithmetic 

SRA 

3a/3b 

3 


27 


1 

Shift Right Logical 

SRL 

3a/3b 

3 


26 


1 

Store Byte 

STB 

3a/3b 

3 


05 


3 

Store Byte to Alternate space * 

STBA 

3a/3b 

3 


15 


3 

Store Double Floating Point Queue 

STDFQ 

3a/3b 

3 


26 


4 

Store Double Floating Point 

STDF 

3a/3b 

3 


27 


4 

Store Double Word 

STD 

3a/3b 

3 


07 


4 

Store Double Word to Alternate space * 

STDA 

3a/3b 

3 


17 


4 

Store Floating Point Register 

STF 

3a/3b 

3 


24 


3 

Store Floating Point Status Register 

STFSR 

3a/3b 

3 


25 


4 

Store Halfword 

STH 

3a/3b 

3 


06 


3 

Store Halfword to Alternate space * 

STHA 

3a/3b 

3 


16 


3 

Store Word 

ST 

3a/3b 

3 


04 


3 

Store Word to Alternate space * 

STA 

3a/3b 

3 


14 


3 

Subtract 

SUB 

3a/3b 

3 


04 


1 

Subtract and Set Condition Code 

SUBCC 

3a/3b 

3 


14 


1 

Subtract with Carry 

SUBX 

3a/3b 

3 


OC 


1 

Subtract with Carry and Set Condition Code 

SUBXCC 

3a/3b 

3 


1C 


1 

Subtract Double ** 

FSUBD 

3c 

2 


34 

046 


Subtract Single ** 

FSUBS 

3c 

2 


34 

045 


Tagged Add and Set Condition Code 

Tagged Add and Set Condition Code and Trap 

TADDCC 

3a/3b 

3 


20 


1 

on Overflow 

TADDCCTV 

3a/3b 

3 


22 


1 

Tagged Subtract and Set Condition Code 

Tagged Subtract and Set Condition Code and 

TSUBCC 

3a/3b 

3 


21 


1 

Trap on Overflow 

TSUBCCTV 

3a/3b 

3 


23 


1 

Trap on Integer Condition Code 

TICC 

3a/3b 

2 


3A 


1.4(3) 

Write Processor State Register * 

WRPSR 

3a/3b 

2 


31 


1 

Write Trap Base Register * 

WRTBR 

3a/3b 

2 


33 


1 

Write Window Invalid Mask Register * 

WRWIM 

3a/3b 

2 


32 


1 

Write Y Register 

WRY 

3a/3b 

2 


30 


1 


* Privileged Instruction 

** Floating Point Operate Instruction 

Notes: 

(1) One cycle If branch taken; two cycles if branch not taken. 

(2) Fields other than op, rd, and op3 are Ignored. 

(3) Four cycles If trap taken; one cycle if trap not taken. 
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3.1. Data Transfer Instructions 

The Load and Store instructions are the only in¬ 
structions which access main memory, using two lU 
registers (Format 3a) or an IU register and a signed 
immediate value (Format 3b) to calculate the 32-bit 
byte address in memory. In addition to the address, 
the processor always generates an address space 
identifier, or asi (see Section 2.4). The instruction’s 
destination field specifies either an ID register, an 
FPU register, or a coprocessor register to supply the 
data for a store or receive the data from a load. I/O 
device registers are accessed via load/store instruc¬ 
tions. 


Mnemonic 

op 

op3 

operation 

LDSB 

11 

001001 

Load Signed Byte 

LDSBA 

11 

011001 

Load Signed Byte from 
Alternate space 

LDSH 

11 

001010 

Load Signed Halfword 

LDSHA 

11 

011010 

Load Signed Halfword 
from Alternate space 

LDUB 

11 

000001 

Load Unsigned Byte 

LDUBA 

11 

010001 

Load Unsigned Byte 
from Alternate space 

LDUH 

11 

000010 

Load Unsigned 
Halfword 

LDUHA 

11 

010010 

Load Unsigned 
Halfword from 
Alternate space 

LD 

11 

000000 

Load word 

LDA 

11 

010000 

Load word from 
Alternate space 

LDD 

11 

000011 

Load Doubleword 

LDDA 

11 

010011 

Load Doubleword 
from Alternate space 

STB 

11 

000101 

Store Byte 

STBA 

11 

010101 

Store Byte Into 
Alternate space 

STH 

11 

000110 

Store Halfword 

STHA 

11 

010110 

Store Halfword Into 
Alternate space 

ST 

11 

00100 

Store Word 

STA 

11 

010100 

Store Word Into 
Alternate space 

STD 

11 

000111 

Store Doubleword 

STDA 

11 

010111 

Store Doubleword 
into Alternate space 


Figure 3.1 Instruction Formats 



The effective address for a load or store instruction 
is either “ttrsi] = r[rs2]” if the I field is zero, or 
“r[rs1] + simm13” if the I field is one. Instructions 
which load from or store to an alternate address 
space must have zero in the I field and the address 
space identifier to be used in the asi field. Other¬ 
wise, the address space indicated is either a user or 
system data space, according to the S bit of the 
PSR (see Section 2.5.2.1). 

Note that the load/store alternate instructions are 
privileged; they can only be executed in the supervi¬ 
sor mode. 


UL 

rd 

op3 

rs1 

bO 

asi 

rs2 1 

31 30 

29 

25 

24 


19 

18 


14 

13 

12 5 

4 0 






Format 3a 




QI] 

rd 

op3 

rs1 

bl 

simm13 | 

31 30 

29 

25 

24 


19 

18 


14 

13 

12 

0 


Format 3b 
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3.1.1. Multiprocessor Support instructions 

The special instruction “atomic load and store un¬ 
signed byte” (LDSTUB) supports tightly coupled 
multiprocessors. The LDSTUB instruction reads a 
byte from memory into an lU register and the re¬ 
writes the same byte in memory to all ones, while 
also precluding intervening accesses on the 
memory or I/O bus. The LDSTUB instruction may be 
used to construct semaphores. It has byte, rather 
than word, addressing because word-wide registers 
may not be aligned at word addresses in general 
purpose I/O buses. 


Mnemonic 

op 

op3 

operation 

LDSTUB 

11 

001101 

Atomic Load-Store 




Unsigned Byte 

LDSTUBA 

11 

011101 

Atomic Load-Store 


Unsigned Byte into 
Alternate space 


nn 

rd 

op3 

rsl 

i=0 

asi rs2 | 

31 30 29 25 

24 19 18 14 13 

Format 3a 

12 5 4 0 


rd 

op3 

1 rsl 


simm13 | 


31 30 29 23 24 19 18 14 13 12 0 


Format 3b 


These instructions move a byte from memory into 
the r register specified by the rd field, rewriting the 
same byte to memory to all ones without allowing in¬ 
tervening asynchronous traps. In a multiprocessor 
system, two or more processors executing atomic 
load-store instructions addressing the same byte 
simultaneously are guaranteed to execute them in 
some serial order. 

The effective address for a load or store instruction 
is either “r[rsl] = r[rs2]” if the I field is zero, or 
“r[rs1] + slmm13” if the i field is one. LDSTUBA 
must have zero in the i field, or an illegal instruction 
trap occurs. The address space identifier used for 
the memory accesses is taken from the asi field. 
For LDSTUB, the address space indicates either a 
user or system data space access, according to the 
S bit in the PSR. 


If an atomic load-store instruction traps, memory re¬ 
mains unchanged. However, an implementation may 
cause a “data access exception” trap during the 
store memory access, but not during the load 
access. In this case, the destination register can be 
changed. 


3.2. Arithmetic/Logical/Shift Instructions 

The Arithmetic/Logical/Shift instructions, with the ex¬ 
ception of SETHI, compute a result that is a function 
of two source operands, and either write the result 
into a destination register, r[rd], or discard it. One of 
the operands is always register r[rs1]. The other 
operand depends on the i bit in the instruction for¬ 
mat: if I = 0, the operand is register r[rs2], but if I = 
1, the operand is a 13-bit sign-extended constant, 
simm 13. 


3.2.1. Add and Subtract 


Mnemonic 

op 

op3 

operation 

ADD 

10 

000000 

Add 

ADDcc 

10 

010000 

Add and modify icc 

ADDX 

10 

001000 

Add with carry 

ADDXcc 

10 

011000 

Add with carry and 
modify icc 

SUB 

10 

000100 

Subtract 

SUBcc 

10 

010100 

Subtract and modify icc 

SUBX 

10 

001100 

Subtract with carry 

SUBXcc 

10 

011100 

Subtract with carry 
and modify icc 


E 

rd 

op3 

rsl 

isO 

ignored 


31 30 

29 25 

24 19 18 14 

Format 3a 

13 

12 5 4 0 

H 

rd 

op3 

rsl 

[m] 

simm13 | 


31 30 29 2S 24 19 18 14 13 12 0 


Format 3b 
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ADD and ADDcc (SUB and SUBcc) compute either 
“r[rs1] + (-) r[rs2]” if I = 0, or “r[rs1] + (-) simml3” if 
i = 1, and place the result in the r register specified 
in the rd field. 

ADDX and ADDXcc (SUBX and SUBXcc) add (sub¬ 
tract) the PSR’s carry bit c also; that is, they 
compute “r[rs1] + (-) r[rs2] + (-) c” or “rfrslj + (-) 
simm13 + (-) c”, if i = 0 or i = 1, respectively, and 
place the result in the r register specified in the rd 
field. 

ADDcc and ADDXcc (SUBcc and SUBXcc) modify 
all the integer condition codes. 


3.2.2. Tagged Add and Subtract 


Mnemonic 

op 

op3 

operation 

TADDcc 

10 

100000 

Tagged Add and 
modify icc 

TADDccTV 

10 

100010 

Tagged Add, modify 
icc, Trap on Overflow 

TSUBcc 

10 

100001 

Tagged Subtract, and 
modify icc 

TSUBccTV 

10 

100011 

Tagged Subtract, 
modify icc. Trap on 
Overflow 


QT 

rd 

op3 

rsl 

i=0 

ignored 

rs2 1 

31 30 

29 

25 

24 


19 

18 14 

13 

12 5 

4 0 






Format 3a 




m 

rd 

op3 

rsl 

151 

simm13 | 

31 30 

29 

25 

24 


19 

18 14 

13 

12 

0 


Format 3b 


Tagged add and subtract instructions operate on 
tagged data where the tag is the least significant 
two bits of data. TADDcc and TADDccTV (TSUBcc 
and TSUBccTV) compute either “r[rs1] + (-) r[rs2]” 
if i = 0, or “r[rs1] + (-) simm13” if i = 1, and place 
the result in the r register specified in the rd field. If 
either of the source operands has a nonzero tag, 
the overflow bit, v, of the PSR is set. This bit is also 
set if the addition (subtraction) generates an arith¬ 
metic overflow. 


If a TADDccTV (TSUBccTV) results in an overflow 
condition, a ‘lag overflow” trap is generated and the 
destination register and condition codes remain un¬ 
changed. If a TADDccTV (TSUBccTV) does not 
result in an overflow condition, all the integer condi¬ 
tion codes are updated (in particular, the overflow v 
bit is set to 0), and the result of the addition (sub¬ 
traction) is written into the r register specified by the 
rd field. 

If a TADDcc (TSUBcc) results in an overflow condi¬ 
tion, the overflow bit v of the PSR is set; if the 
instruction does not result in an overflow, the bit is 
cleared. In either case, the remaining integer condi¬ 
tion codes are also updated and the result of the 
addition (subtraction) is written into the r register 
specified by the rd field. 

The tagged arithmetic instructions can be used by 
languages such as Lisp, Smalltalk, and Prolog 
which benefit from tags. Normally, a tagged 
add/subtract instruction is followed by a conditional 
branch, which, if the overflow bit has been set, 
transfers control to code which further deciphers the 
operand types. The two variants, TADDccTV and 
TSUBccTV, which trap when the overflow bit has 
been set, can be used for error checking when the 
compiler knows the operand types. 

3.2.3. Multiply Step Instruction 

Mnemonic op op3 operation 

MULScc 10 100100 Multiply Step 

and modify icc 


m 

rd 

op3 

1 

IBl 

ignored rs2 | 

31 30 

29 25 

24 19 18 14 

Format 3a 

13 12 5 4 0 

IjoJ 

rd 

op3 

rsl 

i=1 

simm13 | 


31 30 29 25 24 19 18 14 13 12 0 


Format 3b 
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The multiply step instruction can be used to 
generate the 64-bit product of two signed or un¬ 
signed words. In the following description, the 
incoming partial product is located in register r[rs1], 
and the multipiicand is either the contents of register 
r[rs2] (if i = 0) or the sign extended simmIS (if i = 
1). The instruction operates as follows; 

(1) The value obtained by shifting the incoming 
partial product right by one bit and replacing its 
high-order bit by n XOR v (the sign of the pre¬ 
vious partial product—see Section 2.5.2.1 
Integer Condition Codes) is computed. 

(2) If the least significant bit of the Y register (the 
multiplier) is set, the value from step (1) is added 
to the multiplicand. If the LSB of the Y register is 
not set, then zero is added to the value from step 
( 1 ). 

(3) The result from step (2), the outgoing partial 
product, is written into r[rd]. The integer condition 
codes in the Processor State Register are up¬ 
dated according to the addition performed in step 
( 2 ). 

(4) The Y register (the multiplier) is shifted right 
by one bit, and its high-order bit is replaced by 
the least significant bit of the incoming partial pro¬ 
duct. 


3.2.4. Logical Instructions 


Mnemonic 

op 

op3 

AND 

10 

000001 

ANDcc 

10 

010001 

ANDN 

10 

000101 

ANDNcc 

10 

010101 

OR 

10 

000010 

ORcc 

10 

010010 

ORN 

10 

000110 

ORNcc 

10 

010110 

XOR 

10 

000011 

XORcc 

10 

010011 

XNOR 

10 

000111 

XNORcc 

10 

010111 


operation 

And 

And and modify icc 
And Not 

And Not and modify icc 
Inclusive Or 
Inclusive Or and 
modify icc 
Inclusive Or Not 
Inclusive Or Not 
and modify icc 
Exclusive Or 
Exclusive Or 
and modify icc 
Exclusive Nor 
Exclusive Nor 
and modify icc 


Ho" 

rd 

op3 

rs1 

l=0 

Ignored 

rs2 I 

31 30 29 25 24 19 18 14 13 12 5 4 0 


Format 3a 


MO 

rd 

op3 

rs1 

i=1 

simm13 | 

31 30 

29 25 

24 19 

16 14 

13 

12 0 


Format 3b 


These instructions implement the bitwise logical 
operations. With op denoting the generic logical 
operation, these instructions compute either r[rs1] 
op r[rs2] (if I = 0) or r[rs1] op simm13 (if i = 1). 

ANDcc, ANDNcc, ORcc, ORNcc, XORcc, and 
XNORcc modify all the integer condition codes as 
described in Section 2.5.2.1. 


3.2.5. Shift Instructions 


Mnemonic 

op 

op3 

operation 

SLL 

10 

100101 

Shift Left Logical 

SRL 

10 

100110 

Shift Right Logical 

SRA 

10 

loom 

Shift Right Arithmetic 


10 


rd 


op3 


rsl 


isO 


ignored 


rs2 


14 13 12 


Format 3a 


MO 

rd 

op3 

rs1 

1=1 

simm13 | 

31 30 

29 25 

24 19 

18 14 

13 

12 0 


Format 3b 


The shift count for these instructions is either the 
least significant five bits of r[rs2] (if i = 0) or the 
least significant five bits of simm13 (if 1 = 1). 

SLL shifts the value of r[rs1] left by the number of 
bits implied by the shift count. 

SRL and SRA shift the value of r[rs1] right by the 
number of bits implied by the shift count. 

SLL and SRL replace vacated positions with zeroes, 
whereas SRA fills vacated positions with the most 
significant bit of r[rs1I. No shift occurs when the 
shift count is zero. 
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All Of these instructions place the shifted result in 
r[rd]. These instructions do not modify the integer 
condition codes. 

3.2.6. Save and Restore Instructions 

Mnemonic op op3 operation 

SAVE 10 111100 Save caller’s window 

RESTORE 10 111101 Restore caller’s 

window 



1 rd 1 

op3 

rsl 

i=0 

ignored 

1 1 

31 30 

29 25 

24 19 18 14 

Format 3a 

13 

12 5 4 0 

Ljo 

rd 

op3 

1 1 

s 

simm13 | 


31 30 28 25 24 19 18 14 13 12 0 


Format 3b 


The SAVE instruction subtracts one from the CWP 
(modulo 7) and compares this value, the “new 
CWP”, against the Window Invalid Mask (WIM) reg¬ 
ister. if the WiM bit corresponding to the “new CWP” 
is set, then a window overfiow trap is generated. If 
the corresponding WIM bit is reset, then a window 
overflow trap is not generated and “new CWP” is 
written into CWP. This causes the active window to 
become the previous window, thereby saving the 
caller’s window. 

The RESTORE instruction adds one to the CWP 
(modulo 7) and compares this value, the “new 
CWP”, against the Window Invalid Mask (WiM) reg¬ 
ister. If the WIM bit corresponding to the “new CWP” 
is set, then a window underflow trap is generated. If 
the corresponding WiM bit is reset, then a window 
underflow trap is not generated and “new CWP” is 
written into CWP. This causes the previous window 
to become the active window, thereby restoring the 
caller’s window. 

Furthermore, if an overfiow or underfiow trap is not 
generated, SAVE and RESTORE behave iike nor¬ 
mal ADD instructions, except that the operands 
r[rsl] or r[rs2] are read from the old window (i.e., 
the window addressed by the original CWP) and the 
result is written into r[rd] of the new window (i.e., 
the window addressed by the new CWP). 

Note that CWP arithmetic is performed modulo 7. 


3.2.7. SETHI Instruction 

Mnemonic op op2 operation 

SETHI 00 100 Set High 


o 

o 

CL 


imm22 

1 

31 30 29 

25 24 22 21 


0 


Format 2a 


The SETHI instruction is a speciai instruction that 
can be used in combination with a standard arith¬ 
metic instruction to construct a 32-bit constant in 
two instructions, it loads a 22-bit immediate value, 
imm22, into the high order bits of the destination 
register r[rd] and zeroes the least significant 10 bits. 
The integer condition codes are not affected. 
Another instruction is used to ioad the low 10 bits. 
Because other arithmetic/logical instructions have a 
13-bit signed immediate value, the 22-bit SETHI 
value implies an overiap of 2 bits in the result. In 
combination with a ioad or store instruction, SETHI 
can also be used to construct a 32-bit load/store 
address. The SETHI instruction uses Format 2a, 
shown. 

It is suggested that sethi 0, %0, where %0 stands 
for global register 0, be used as the preferred NOP, 
since it will not cause an increase in execution time 
if it follows a load instruction. 

3.3. Control Transfer Instructions 

Control transfer instructions change the values of 
PC and nPC. There are five types of controi transfer 
instructions: 

(1) Conditionai branch (Bicc) 

(2) Call (CALL) 

(3) Jump and Link (JMPL) 

(4) Trap (Ticc) 

(5) Return from trap (RETT) 

Each of these instructions can be further catego¬ 
rized according to whether it is 1) PC (program 
counter)-reiative or register-indirect, or 2) deiayed or 
non-delayed. A PC-relative control transfer com¬ 
putes its target address by adding the (shifted) 
sign-extended immediate displacement to the pro¬ 
gram counter. A register-indirect instruction com¬ 
putes its target address as either r[rs1] + r[rs2] (if I 
= 0), or r[rs1] + simm13 (if I = 1). A controi transfer 
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instaiction is delayed if it transfers control to the tar- In every case, if a = 0, the delay instruction is ex- 
get after a one-instruction delay. In summary: ecuted. 


Instruction Characteristic Delayed 

Bicc, CALL PC-relative Yes 

JMPL, RETT register-indirect Yes 

Ticc register-indirect No 


The rationale behind the use of delayed control 
transfers is explained in Section 2.1.5 under the dis¬ 
cussion of the MB86901 pipeline. The mechanics of 
the operation is explained in Section 2.5.3 Program 
Counters. 

3.3.1. Branch on Integer Condition Instructions 

Mnemonic op op2 operation 

Bicc 00 010 See Table 3.3 


1 00 

I 

cond 

010 

disp22 1 

31 30 

Ml 

28 2S 

24 22 

21 0 


Format 2b 


Execution of Delay Instruction 



a = 0 

a.^1 

Conditional/Taken 

YES 

YES 

Conditional/Not Taken 

YES 

NO 

BA 

YES 

NO 

BN 

YES 

NO 


Note that the result for BA with a = 1 is the reverse 
of what might be expected. The delay instruction is 
NOT executed even though the branch IS taken. 

An annulled instruction has no effect on the state of 
the lU, nor can any trap occur during an annulled in¬ 
struction. 

3.3.2. CALL Instruction 

Mnemonic op operation 

CALL 01 Call 


A Bicc instruction (except BA and BN) evaluates the 
integer condition codes (Icc) of the Processor State 
Register (PSR) according to the cond field. If the 
condition codes evaluate to true, the branch is 
taken, and the instruction causes a PC-relative, 
delayed control transfer to the address “PC + 
(4*sign_ext(dlsp22))”. If the condition codes eval¬ 
uate to false, the branch Is not taken. 

The annul bit a changes the behavior of the delay 
Instruction. This bit is only available on conditional 
branch instructions (Bicc). The effect of the annul bit 
may be summarized as follows: 

If a = 1, and the branch in a conditional branch is 
taken, then the delay instruction which follows is 
executed. 

If a = 1, and the branch is not taken, then the 
delay instruction is annuiled (not executed). 

Branch Never (BN) acts like a NOP except that if 
a = 1, the delay instruction is annulled (not ex¬ 
ecuted). 

Branch Always (BA) causes transfer of control re¬ 
gardless of the value of the condition code bits. If 
a = 1, the delay instruction is annulled (not ex¬ 
ecuted). 


01 


dispSO 


31 30 29 


0 


Format 1 


A procedure that requires a register window is in¬ 
voked by executing both a CALL and a SAVE (see 
Section 3.2.6) instruction. A procedure which does 
not need a register window, a so-called “leaf” 
routine, is invoked by executing only a CALL instruc¬ 
tion. Leaf routines can use only the out registers. 

The CALL instruction stores the PC, which points to 
the CALL itself, into out register r[15]. The CALL in¬ 
struction then causes an unconditional, delayed, 
PC-relative control transfer to address “PC + (4 * 
disp30)”. The PC-relative displacement is formed by 
appending two low order zeroes to the instruction’s 
30-bit word displacement field. Since the word dis¬ 
placement (disp30) field is 30 bits wide, the target 
address can be arbitrarily distant. 

A JMPL instruction (see Section 3.3.3) with rd = 15 
can be used as a register indirect CALL. 
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Table 3.3 integer Branch Conditions 


Mnemonic 

cond 

Operation 

icc Test 

BA 

1000 

Branch Always 

1 

BN 

0000 

Branch Never 

0 

BNE 

1001 

Branch on Not Equal 

not z 

BE 

0001 

Branch on Equal 

z 

BG 

1010 

Branch on Greater 

not (z or (n xor V)) 

BLE 

0010 

Branch on Less or Equal 

z or (n xor v) 

BGE 

1011 

Branch on Greater or Equal 

not (n xor v) 

BL 

0011 

Branch on Less 

n xor V 

BGU 

1100 

Branch on Greater Unsigned 

not (c or z) 

BLEU 

0100 

Branch on Less or Equal Unsigned 

(c or z) 

BCC 

1101 

Branch on Carry Clear * 

not c 

BCS 

0101 

Branch on Carry Set ** 

c 

BROS 

1110 

Branch on Positive 

not n 

BNEG 

0110 

Branch on Negative 

n 

BVC 

1111 

Branch on Overflow Clear 

not V 

BVS 

0111 

Branch on Overflow Set 

V 


* Greater Than or Equal, Unsigned 
** Less Than, Unsigned 


Table 3.4 Integer Trap Conditions 


Mnemonic 

cond 

Operation 

icc Test 

TA 

1000 

Trap Always 

1 

TN 

0000 

Trap Never 

0 

TNE 

1001 

Trap on Not Equal 

not z 

TE 

0001 

Trap on Equal 

z 

TG 

1010 

Trap on Greater 

not (z or (n xor v)) 

TLE 

0010 

Trap on Less or Equal 

z or (n xor v) 

TGE 

1011 

Trap on Greater or Equal 

not (n xor v) 

TL 

0011 

Trap on Less 

n xor V 

TGU 

1100 

Trap on Greater Unsigned 

not (c or z) 

TLEU 

0100 

Trap on Less or Equal Unsigned 

(c or z) 

TCC 

1101 

Trap on Carry Clear * 

not c 

TCS 

0101 

Trap on Carry Set ** 

c 

TPOS 

1110 

Trap on Positive 

not n 

TNEG 

0110 

Trap on Negative 

n 

TVC 

1111 

Trap on Overflow Clear 

not V 

TVS 

0111 

Trap on Overflow Set 

V 


* Greater Than or Equal, Unsigned 
** Less Than, Unsigned 
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3.3.3. Jump and Link Instruction 

Mnemonic op op3 operation 

JMPL 10 111000 JumpandLink 


MoJ 

rd 

op3 

rs1 


ignored 

1 rs2 i 

31 30 29 25 

24 19 18 14 13 

Format 3a 

12 5 4 0 

nr 

rd 

op3 

rsl 


simm13 | 


31 30 29 25 24 19 18 14 13 12 0 


Format 3b 


A procedure which uses a register window returns 
by executing both a RESTORE (see Section 3.2.6) 
and a JMPL instruction. A ieaf procedure returns by 
executing a JMPL only. The JMPL instruction typi- 
caiiy returns to the instruction foliowing the CALL’S 
delay instruction. In other words, the typical return 
address is 8 plus the address saved by the CALL. 

The JMPL instruction causes a register indirect con¬ 
trol transfer to an address specified by either r[rs1] 
+ r[rs2] (if i = 0), or r[rs1] + simm13 (if I = 1). The 
JMPL instruction writes the PC, which contains the 
address of the JMPL instruction, into the destination 
register specified in the rd field. If either of the two 
low order bits of the jump address is nonzero, a 
“mem address not aligned” trap occurs. 

3.3.4. Trap on Integer Condition Codes 

Mnemonic op op3 operation 

Ticc 10 111010 See Table 3.4 


UL 

rd 

op3 

rsl 

[E 

ignored 

rs2 1 

31 30 

29 25 24 19 18 14 

Format 3a 

13 

12 5 4 0 

QI] 

rd 

op3 

rsl 1 

i=1 

simm13 | 


31 30 29 25 24 19 18 14 13 12 0 


Format 3b 


A Ticc instruction evaluates the integer condition 
codes (icc) of the Processor State Register accord¬ 
ing to the cond field, if the condition codes evaluate 
to true, and there are no higher priority traps pend¬ 
ing, then a “trap instruction” trap is generated with 


no delay instruction. If the condition codes evaluate 
to false, it executes as a NOP. 

If a trap is generated, the tt field of the Trap Base 
Register (TBR) is written with 128 plus the least sig¬ 
nificant seven bits of either r[rs1] + rlrs2] (if i = 0), 
or r[rs1] + simm13 (if i = 1). The processor enters 
the supervisor mode, disables traps, decrements 
the CWP, and saves PC and nPC into local registers 
r[17] and r[18] (respectively) of the new window. 
Ticc can be used to implement kernel calls, break¬ 
pointing, and tracing. It can also be used for 
run-time checks, such as out of range indices, in¬ 
teger overflow, etc. 

See Section 5 on Traps, Exceptions, and Error 
Handling for a more complete discussion of traps. 

3.3.5. Return from Trap Instruction 

Mnemonic op op3 operation 

RETT 10 111001 Return from Trap 

The RETT instruction uses either Format 3a or For¬ 
mat 3b, as described in Section 3.3.4. The RETT 
instruction adds one to the CWP (modulo 7) and 
compares this value, the “new CWP”, against the 
Window Invalid Mask (WIM) register. If the WIM bit 
corresponding to the “new CWP” is set, then a win¬ 
dow underflow trap is generated. If the 
corresponding WIM bit is reset, then a window un¬ 
derflow trap is not generated and “new CWP” is 
written into CWP. This causes the previous window 
to become the active window, thereby restoring the 
window that existed at the time of the trap. 

If a window underflow trap is not generated, RETT 
causes a delayed control transfer to the target 
address. The target address is either r[rs1] + r[rs2] 
(if I = 0), or r[rs1] + simm13 (if I = 1). Furthermore, 
RETT restores the S field of the PSR from the PS 
field, and sets the ET field to one (see Section 
2.5.2.3). 

If traps are enabled (ET = 1), an illegal instruction 
trap occurs. If traps are disabled (ET = 0), and the 
processor is not in supervisor mode (S = 0), or if a 
window underflow condition is detected, or if either 
of the two low order bits of the target address is 
nonzero, a reset trap occurs. If a reset trap occurs, 
the tt field of the TBR encodes the trap condition: 
privileged instruction, window overflow, or memory 
address not aligned. 
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The instruction executed immediately before a 
RETT must be a JMPL instruction. If it is not, the lo¬ 
cation where execution continues is not necessarily 
within the address space implied by the PS bit of the 
PSR. Trap handlers complete execution by execut¬ 
ing the “JMPL, RETT” couple. 

3.4. Read/Write Control Register Instructions 

These instructions read or write the contents of the 
programmer visible control registers. This category 
includes instructions to read and write the PSR, the 
WIM, the TBR, the Y register (see Section 2.5.2), as 
well as the Floating point State Register (FSR) and 
the Coprocessor State Register (CSR). These in¬ 
structions are all privileged (available in supervisor 
mode only), except those that read and write the Y 
register, the FSR, and the CSR. 


3.4.1. Read State Register Instructions 


Mnemonic 

op 

op3 

operation 

RDY 

10 

101000 

Read Y register 

RDPSR 

10 

101001 

Read Processor State 
Register 

RDWIM 

10 

101010 

Read Window Invalid 
Mask register 

RDTBR 

10 

101011 

Read Trap Base 
Register 


Ul 


op3 

ignored | 


31 30 29 25 24 19 18 

Format 3a 


These instructions read the specified lU state regis¬ 
ters into the r register specified in the rd field 


OE 

ignored 

op3 

rs1 

i=1 

simm13 | 

31 30 

29 25 

24 19 

18 14 

13 

12 0 


Format 3b 


These instructions write either r[rs1] XOR r[rs2] (if I 
= 0), or r[rs1] XOR simm13 (if I = 1), to those sub¬ 
fields of the specified lU state register which may be 
written. WRPSR does not write the PSR and causes 
an illegal instruction trap if the result would cause 
the CWP field of the PSR to point to an unimple¬ 
mented window. 

These instructions are delayed-write instructions: 

(1) If any of the three instructions after a WRPSR 
uses any field of the PSR that is changed by the 
WRPSR, the value of that field is unpredictable. 
(Note that any instruction which references a 
non-global register implicitly uses the CWP.) 

(2) If a WRPSR instruction is updating the PSR’s 
PIL to a new value and is simultaneously setting 
ET to 1, this can result in an interrupt trap at a 
level equal to the old value of the PIL. Two 
WRPSR instructions should be used when ena¬ 
bling traps and changing the value of the PIL. 
The first WRPSR should specify ET = 0 with the 
new PIL value, and the second WRPSR should 
specify ET = 1 and the new PIL value. 

(3) If any of the three instructions after a WRWIM 
is a SAVE, RESTORE, or RETT, the occurrence 
of window overflow and window underflow traps 
is unpredictable. 


3.4.2. Write State Register instructions 


Mnemonic 

op 

op3 

WRY 

10 

110000 

WRPSR 

10 

110001 

WRWIM 

10 

110010 

WRTBR 

10 

110011 


operation 
Write Y register 
Write Processor 
State Register 
Write Window Invalid 
Mask register 
Write Trap Base 
Register 


Ql 

ignored 

op3 

rs1 

i=0 

ignored 

rs2 I 


31 30 29 25 24 19 18 14 13 12 5 4 0 


(4) If any of the three instructions which follow a 
WRY is a MULScc or RDY, the value of Y used is 
unpredictable. 

(5) If any of the three instructions which follow a 
WRTBR causes a trap, the trap base address 
(TBA) used may be either the old or the new 
value. 

(6) If any of the three instructions after a write 
state register instruction reads the modified state 
register, the value read is unpredictable. 


Format 3a (7) If any Of the three instructions after a write 

state register instruction is trapped, a subsequent 
read state register instruction in the trap handier 
will get the register’s new value. 
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4. Floating Point Operations 

The MB86901 executes all SPARC instructions ex¬ 
cept for Floating Point Operate (FPop) instructions. 
All the floating point operations are actually dis¬ 
patched through the F<31:0> bus to the MB86911 
Floating Point Controller (FPC). Here, they enter a 
First In First Out (FIFO) floating point queue to await 
actual floating point processing done by the Tl 
SN74ACT8847 Floating Point Processor (FPP). In 
addition to the floating point queue, the MB86911 
also includes the data register file, status register, 
and control circuitry for mastering floating point ex¬ 
ceptions. The FPC and the FPP are collectively re¬ 
ferred to as the Floating Point Unit (FPU). 

The lU and the FPU operate concurrently, assuming 
that the floating point queue remains in a non-over- 
flow status. If the FPU encounters a floating point 
operate instruction that doesn’t fit in the queue, the 
lU stalls untils the required FPU resource becomes 
available. The architecture hides floating point con¬ 
currency from the programmer by means of appro¬ 
priate register interlocks. Hence, a program in¬ 
cluding floating point computations will generate the 
same results as if all instructions were executed 
sequentially. 

The “F-bus” is used only to transfer floating point in¬ 
structions and associated instruction Program 
Counter (PC) values to the FPC. Any data transfer 
to or from the FPC is done through the FPC memory 
data bus. Floating point load/store instructions are 
used to facilitate this scheme. The MB86901 lU 
generates a memory address and the FPU either 
sources or sinks the data. 

Note that floating point loads and stores are not 
Floating Point operate (FPop) instructions. 

The MB86901 Floating Point Interface includes 
floating point condition code, trap reporting, and 
handshake signals, to maintain coordination be¬ 
tween the lU and the FPU. This interface is dis¬ 
cussed further in Section 7.2. The MB86911 and 
Tl SN74ACT8847 data sheets should be consulted 
for a complete discussion of these chips. 

5. Trap and Exception Handling 

The MB86901 provides three types of traps: syn¬ 
chronous, floating point, and asynchronous. A trap is 
like a random procedure call which, as a side effect, 
causes the processor to enter the supervisor state. 


Synchronous traps are caused by an lU instruction 
or a floating point load/store instruction, and occur 
before the instruction is completed. Floating point 
traps are caused by a Floating Point Operate 
(FPop) instruction and occur before the instruction 
is completed. However, due to the concurrent opera¬ 
tion of the lU and the FPU, other non-floating point 
instructions may have executed in the meantime. 
Asynchronous traps (also called interrupts) occur 
when an external event interrupts the processor. 
They are not related to any particular instruction, 
and occur between the execution of instructions. 

5.1. Synchronous Traps 

A Synchronous trap is defined to be a trap that oc¬ 
curs during the course of an instruction execution, 
prior to the alteration of any processor or system 
state visible to the programmer. Instructions which 
access memory twice (double loads and stores and 
atomic instructions) are the only exceptions. If multi¬ 
ple traps occur during one instruction, the highest 
priority trap is taken. Lower priority traps are ignored 
because the traps are arranged under the assump¬ 
tion that the lower priority traps persist, recur, or are 
meaningless due to the presence of the higher prior¬ 
ity trap. 

The ET bit in the PSR (see Section 2.5.2.1) must be 
set for synchronous traps to occur normally. Should 
a synchronous trap occur while traps are disabled, 
the processor halts and enters an error state. A syn¬ 
chronous trap may originate due to either internal or 
external events. 

An example of an internal event is an attempt to ex¬ 
ecute an illegal instruction or to execute a privileged 
instruction while the processor is in the User mode. 
External events include asserting the MEXC 
(Memory Exception) line during a given active bus 
cycle, thus initiating an instruction access or data 
access. 

5.2. Floating Point Traps 

All floating point instructions are dispatched from 
the lU to a floating point instruction queue, in the 
FPU, where they execute concurrently with any non¬ 
floating point lU operations. It is therefore typical 
that a floating point exception occur sometime after 
the dispatch of the floating point instruction that 
caused the exception. However, a floating point ex¬ 
ception trap is not taken until another floating point 
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instruction has been encountered in the ID instruc¬ 
tion stream. 

When the FPU recognizes an exception condition, it 
enters an “exception pending” state and signals the 
lU through assertion of the “FEXC” signal. It re¬ 
mains in this state until the lU takes the floating 
point exception trap, whereafter it enters the “excep¬ 
tion mode" state. To exit this state, the floating point 
queue must be emptied by one or more “STDFQ” in¬ 
structions. 

The PC (Program Counter) that is saved during a 
floating point exception trap always points to a float¬ 
ing point instruction. However, the exception itself is 
always due to a previously executed floating point 
instruction. This instruction and the value of the PC 
from which it was fetched are in the floating point 
queue. 

5.3. Asynchronous Traps 

All asynchronous traps are signalled to the proces¬ 
sor through the IRL<3:0> (Interrupt Request Level) 
bus. They are divided up into two categories: the 
maskable interrupts, and the Non-Maskable Inter¬ 
rupt (NMI). 

The maskable interrupts must meet two critera for 
the processor to initiate an asynchronous trap pro¬ 
cedure. First, the ET (enable trap) bit must be set in 
the PSR. Second, the IRL must be greater than the 
value in the Processor Interrupt Level (PIL) field of 
the PSR. An IRL<3:0> = 0000 value is the standby 
mode, and will not result in any action. 

The Non-Maskable Interrupt is signalled by IRL<3:0> 
= 1111, and can only be disabled by turning off ET. 
Any asynchronous interrupts have a lower priority 
when arbitrated with pending synchronous traps. 

5.4. Trap Addressing 

The Trap Base Register (TBR) generates the exact 
address of a trap handling routine. When a trap oc¬ 
curs, the hardware writes a value into the Trap Type 
(tt) field of the TBR. This uniquely identifies the trap 
and serves as an offset into the table whose starting 
address is given by the TBA field of the TBR. The 8- 
bit wide tt field allows for 256 distinct types of traps. 
The lower half is used by hardware initiated traps 
while the upper half is dedicated to programmer in¬ 
itiated traps (see Ticc instruction. Section 3.3.4). 


The MB86901 currently utilizes 26 out of the 128 

hardware trap types. 

5.5. Trap Priorities 

Table 5.1 shows the trap types, priorities, and as¬ 
signments. Note that ‘It” is not affected by the reset 

trap. 

5.6. Trap Processing 

A trap causes the following activities: 

• It disables traps (ET = 0) 

• It copies the S field of the PSR into the PS field 
and then sets the S field to 1. 

• It decrements the CWP by 1, modulo the number 
of implemented windows. 

• It saves the PC and nPC into r[17] and r[18], re¬ 
spectively, of the new window. 

• it sets the ‘It” field of the TBR to the appropriate 
value 


Table 5.1 Trap Priorities and tt Assignments 


Trap 

Priority 

tt 

Reset 

1 

— 

Instruction Access Exception 

2 

1 

illegal instruction 

3 

2 

Privileged Instruction 

4 

3 

Floating Point Disabled 

5 

4 

Window Overflow 

6 

5 

Window Underflow 

7 

6 

Memory Address Not Aligned 

8 

7 

Floating Point Exception 

9 

8 

Data Access Exception 

10 

9 

Tag Overflow 

11 

10 

Trap Instruction 

12 

128-255 

Interrupt Level 15 

13 

31 

Interrupt Level 14 

14 

30 

Interrupt Level 13 

15 

29 

Interrupt Level 12 

16 

28 

Interrupt Level 11 

17 

27 

Interrupt Level 10 

18 

26 

Interrupt Level 9 

19 

25 

Interrupt Level 8 

20 

24 

Interrupt Level 7 ! 

21 

23 

Interrupt Level 6 

22 

22 

Interrupt Level 5 

23 

21 

Interrupt Level 4 

24 1 

20 

Interrupt Level 3 

25 1 

19 

Interrupt Level 2 

26 1 

18 

Interrupt Level 1 

27 

17 
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• If the trap is not a reset, it writes the PC with the 
contents of TBR, and the nPC with the contents 
of TBR+ 4. If the trap is a reset, it loads the PC 
with 0 and nPC with 4. 

The MB86901, like all SPARC processors, saves 
the volatile S field into the PSR itself and the re¬ 
maining fields are either altered in a reversible way 
(ET and CWP), or should not be altered in a trap 
handler until the PSR has been saved into memory. 
To restore the PSR S-bit and PC/nPC, a trap han¬ 
dler should include “JMPL” and “RETT” as the last 
two instructions. 

5.7. Interrupt Detection 

The MB86901 will latch the IRL <3:0> bus at the ris¬ 
ing edge of every clock cycle. If the output of this 
latch equals the current value of the IRL bus by the 
following cycle, an asynchronous trap may be in¬ 
itiated if the “ET” and “IPL” criteria are met. The best 
case for an interrupt response time is 3 cycles. This 
assumes the request arriving within the ID trap arbi¬ 
tration logic at a time when an instruction has just 
completed. The worst case is asserting an interrupt 
during the execution of a 4 cycle double store. That 
would add 3 cycles to the overhead, thus yielding a 
response time of 6 cycles. Neither of these esti¬ 
mates include any addit ional delay caused by asser¬ 
tion of MHOLD/FHOLD or interrupts being disabled. 

5.8. Trap Definitions 

The following discussion describes the 86901 traps 
and the conditions which cause them. 

instruction access exception_ 

This trap occurs when the MEXC signal line has 
been strobed during an instruction fetch. It may be 
used to indicate error conditions such as page faults 
and access violation. 

illegal Instruction 
This trap occurs when: 

(1) the UNIMP instruction is encountered, or 

(2) an unimplemented instruction which is not an 
FPop is encountered, or 

(3) when an instruction is fetched which, if ex¬ 
ecuted, would result in an illegal processor state 
(e.g. writing an illegal CWP into the PSR). 

Unimplemented FPop instructions generate fp ex¬ 
ception traps. 


privileged instruction 

This trap occurs when a privileged instruction is en¬ 
countered while the processor is in User mode (S = 
0 in PSR). 

fp disabled 

This trap occurs when a Floating Point Operate, 
Floating Point Branch, or Floating Point Load or 
Store instruction is encountered while the ^bit in 
the PSR is equal to 0, or no FPC is present (FP = 1). 

window overflow 

This trap occurs when a SAVE instruction would, if 
executed, cause the CWP to point to a window 
marked invalid in the WIM. 

window underflow 

This trap occurs when a RESTORE instruction 
would, if executed, cause the CWP to point to a win¬ 
dow marked invalid in the WIM. 

mem address not aligned 

This trap occurs when a load, store or JMPL instruc¬ 
tion would, if executed, generate a memory address 
or a new PC value that is not properly aligned. 

fp exception 

This trap occurs when the FPU is in exception pend¬ 
ing state and a floating point instruction (FPop, float¬ 
ing point load/store, FBbcc) is encountered. 

data access exception _ 

This trap occurs when the MEXC signal line has 
been strobed during an instruction fetch. It may be 
used to indicate error conditions such as page faults 
and access violation. 

tag overflow 

This trap occurs when a TADDccTV or a TSUBccTV 
instruction is executed which causes the overflow bit 
of the integer condition codes to be set. 

trap instruction 

This trap occurs when a taken Ticc instruction is ex¬ 
ecuted. 

interrupt level<3:0> 

A trap of this category occurs during assertion of a 
vector on the IRL<3;0> bus. The lower four bits of tt 
are equivalent to the vector asserted on the IRL 
bus. 
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Table 7.1 Bus Signal Descriptions 

Signal 

Type 

Description 

CLK1 

1 

CLOCK 1. This is typically a 75/25% duty cycle clock which is high during the 
first three quarters of each processor cycle, and low during the last quarter. 

CLK2 

1 

CLOCK 2. This is typically a 25/75% duty cycle clock which is low during the 
first quarter of each processor cycle, and high during the last three quarters. 

ADR<31:0> 

0 

ADDRESS BUS. This is the address bus which is used during all memory trans¬ 
actions. It is unlatched and therefore only valid at the very beginning of a bus 
cycle. It is tristated while ADROE is high. 


ASI<7:0> O ADDRESS SPACE IDENTIFIER. The Address Space Identifier identifies 

accessed address space as follows: 


ASI<7:0> 

Address Space 

0-7 

Implementation Definable 

8 

User Instruction Space 

9 

Supervisor Instruction Space 

10 

User Data Space 

11 

Supervisor Data Space 

12-255 

Implementation Definable 


ASI<7:0> are forced to a high impedance while AOE is high. 


D<31:0> I/O DATA BUS. This is a 32-bit data bus which is used for in struction and data trans¬ 

fer. D<31:0> are forced to a high impedance while DOE is high. 


HAL O HOLD ADDRESS LATCH. T he HA L signal is used to inhibit latching by the 

ADR<31:0> external latches. HAL is asserted during instruction pipeline inter¬ 
lock activity, during execution of some multiple cycle instructions, and during 
assertion of an MHOLD signal. 


WE O WRITE ENABLE. The WE signal is asserted during Store operations and during 

the store phase of Atomic Load/Store operations to indicate that data to b e 
stored is present on the Data Bus. A ss ertion o f WE is inhibited while FHOLD is 
active, and is terminated when FEXC or MEXC becomes active. 


RD O READ. The RD signal remains asserted during Load operations to indicate that 

the processor will read data, and is released throughout Store and Atomic 
Load/Store operations to i ndicat e that the processor will write data. RD is forced 
to a high impedance while AOE is high. 
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Table 7.1 Bus Signal Descriptions (cont.) 

Signal 

Type 

Description 

□FETCH 

0 

DATA FETCH. The DFETCH signal is asserted for one cycle at the beginning of 
data transfer operations to indicate that data will be transferred. DFETCH re¬ 
mains inactive during instruction fetches and during Floating Point Operate in¬ 
struction transfers. Assertion of NULL_CYC forces release of DFETCH. 

SIZE<1:0> 

0 

DATA SIZE. These signals identify transferred data as follows: 

SiZE<1:0> Data Size 

0 Byte 

1 Halfword 

2 Word 

3 Double Word* 

* (LDD, LDDF, STD, STDF double-word instructions) 

The Data Size signals are forced to a high impedance while AOE is high. 

LOCK 

0 

BUS LOCK. The LOCK signal is asserted by the processor during Store, Load 
Double, Store Double, and Atomic Load/Store operations to indicate to other bus 
masters that the System Interface is busy with multicycle operations. Bus 
masters must not try to gain control of the system bus during the cycle following 
assertion of LOCK. LOCK is forced to high impedance while AOE is high. 

MDS 

I 

MEMORY DATA STROBE. The MDS Strobe is asserted for one or more cycles 
while an MHOLD<A-C> signal is asserted during load and instruction fetch 
operations to indicate that valid data will be asserted on the Data Bus. The data 
must be present on the Data Bus one at least one cycle before MDS is released. 

ADROE 

I 

ADDRESS OUTPUT ENABLE. This active low signal controls the output drivers 
of the ADR<31:0> bus. The drivers are enabled when ADROE is low. and dis- 
abled to a high impedance when ADROE is high. AOE should be asserted while 
the BHOLD signal is active to allow another bus master to gain control of the af¬ 
fected bus signal lines. 

AOE 

I 

ALTERNATE OUTPUT ENABLE. This active low signal controls the output 
drivers of the ASI<7:0>, RD, SIZE<1:0>, and LOST signals.The drivers are 
enabled when AOE is low, and disabled to a high impedance when AOE is high. 
AOE should be asserted while the BHOLD signal is active to allow another bus 
master to gain control of the effected bus signal lines. 

DOE 

I 

DATA OUTPUT ENABLE. This active low signal controls the output drivers of 
the D<31:0> data bus. The drivers are enabled when DOE is low, and disabled 
to a high impedance when DOE is high. DOE should be asserted while a 
Memory Hold signal is active to allow another bus master to gain control of the 
Data Bus. 
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Table 7.1 Bus Signal Descriptions (cont.) 

Signal 

Type 

Description 


MHOLD<A-C> I MEMORY HOLD. Assertion of one or more of the four Memory Hold signals 

SHOLD freezes the processor instruction pipeline, and holds the processor bus signals 

in a wait state. 


A Memory Hold signal can be asserted to introduce wait states in order to ac¬ 
commodate slow memory or I/O response, and cache misses. All of the Memory 
Hold signals are OR’d by processor logic, and function identically. Four Memory 
Hold inputs are provided to allow flexibility in system design. The Memory Hold 
signals should be latched externally, and should be valid prior to CLK1 falling 
edges (see timing diagrams). 


BHOLD I Bus Hold freezes the p rocessor pipeline and allows another bus master to gain 

control of the data bus. BHOLD is active low. It should be latched externally and 
be valid prior to the CLK1 falling edge. 


IRL<3:0> I INTERRUPT REQUEST LEVEL. External system logic, typically a n interrupt 

controller, reports and identifies system interrupts via the IRL<3;0> bus as fol¬ 
lows: 


IRL<3:0> interrupt Request 

0 None 

11-4 Maskable 

15 Non Maskable 

Maskable Interrupts are recognized and serviced only if the ET bit is set in the 
PSR, and the IRL<3:0> value is greater than the PSR PIL<11:8> value. The 
Nonmaskable Interrupt is always recognized while the ET bit is set. 

External logic must prioritize interrupts in cases of simultaneous interrupt re¬ 
quests, and must latch and assert the IRL value corresponding to the highest 
pending interrupt until the interrupt request is cleared by software. 


IRL<3:0> must be asserted synchronously with respect to CLK1. 


RESET I RESET. Assertion of RESET initializes the processor as follows: 

(1) Supervisor mode is selected (S = 1 in PSR). 

(2) The PC is set to 0. 

(3) The N PC is set to 4. 

Othe r registe r fields are either undefined (power on), or retain their state at the 
time RESET is asserted. The processor fetches the fi rst instru ction from address 
0 at the first rising edge of CLK1 following release of RESET . 
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Table 7.1 Bus Signal Descriptions (cont.) 

Signal 

Type 

Description 

TC 

I 

TC determines the behavior of the IFLUSH instruction. When TC is high, 
IFLUSH executes like a NOP with no side effects. When TC is low, IFLUSH 
causes an unimplemented instruction trap. 

MEXC 

I 

MEMORY EXCEPTION. The memory or cache controller asserts MEXC to re- 
port a memory error. The processor recognizes MEXC during the cycle following 
its assertion. 

ERROR 

0 

ERROR. The processor asserts ERROR to report to the system that it has 
halted in the error state as a result of a synchronous trap it has encountered 
while ET = 0 in the PSR. RESET must be asserted for recovery from the error- 
state. 

NULL_CYC 

o 

NULLIFY CURRENT CYCLE. The processor asserts this signal to indicate that 


the address currently latched in the external memory latches is not a valid 
address. This occurs when the processor fetches an instruction (such as the tar¬ 
get of an untake n branch ) whi ch it doesn’t us e. The processor also asserts 
NULL_CYC when FHOLD or an MHOLD<A-C> signal is asserted. 

NULL_CYC should be used by cache logic to eliminate false cache misses due 
to invalid addresses. Assertion of NULL CYC forces release of DFETCH. 


IN_NULL I INHIBIT NULL CYCLE ENABLE. IN_NULL is an asynchronous in put which sys¬ 

tem logic can use to control the assertion of NULL_C YC. When IN_NULL = 1, 
the processor can assert NULL_CYC: when IN_NULL = 0, NULL_CYC is forced 
low. 


LOST O ATOMIC LOAD/STORE. This signal is asserted to indicate that an atomic Load 

or St ore operation is in progress. LOST is forced to a high impedance while 
AOE is high. 


F<31:0> O FLOATING POINT INSTRUCTION/ADDRESS BUS. The processor transfers 

Floating Point Operate instructions and their addresses to the floating point unit 
via this dedicated 32-bit floating point bus. The processor first transfers an in¬ 
struction, then transfers the instruction address the following cycle (see 
MB86911 data sheet). 


FEXC I FLOATING POINT E XCEPT ION. The floating point unit asserts FEXC to report 

a floating point error. FEXC must remain asserted until the processor recognizes 
the trap and asserts FXACK. 
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Table 7.1 Bus Signal Descriptions (com.) 

Signal 

Type 

Description 

FHOLD 

I 

FLOATING POINT HOLD. Assertion of FHOLD halts the processor. The floating 
point unit asserts FHOLD when it detects data dependencies which require that 
the processor be halted. The processor resumes execution when the floating 
point unit releases FHOLD. 

FCC<1:0> 

I 

FLOATING POINT CONDITION CODE. These are floating point condition code 
signals which are asserted by the floating point unit to indicate results of Float¬ 
ing Point Operate instruction executions as follows: 

FCC<1:0> Result Description 

0 operand 1 = operand 2 

1 operand 1 < operand 2 

2 operand 1 > operand 2 

3 unordered relationship 

FCCV 

I 

FLOATING POINT CONDITION CODE VALID. The floating point unit asserts 
FCCV to indicate that the floating point condition code asserted on FCC<1:0> is 
valid. 

FP 

I 

FLOATING POINT UNIT PRESENT. This input is active low when a floating 
point coprocessor is present on the floating point interface bus. 

The processor checks this signal when it encounters a Floating Point Operate 
(FPop) or a floating point Load or Store instruction. The lU acts as follows when 
such an instruction is encountered. When FP is asserted, the lU will dispatch 
the instruction to the FPC. depending on the state of the EF bit in the PSR (see 
Section 2.5.2). When FP is negated, the lU will always initiate an FP disabled 
trap, after which, a floating point library routine can emulate the FPop. 

FXACK 

0 

FLOATING POINT EXCEPTION ACKNOWLEDGE. The processor asserts 
FXACK in response to a Floating Point Exception trap ( FEXC ) to acknowledge 
that it has recognized the trap. The floating point unit should release FEXC 
when FXACK is asserted. 

FLUSH 

0 

FLUSH. The FLUSH signal, when asserted, forces the floating point unit to abort 
(flush) the instruction in the current floating point unit write cycle. FLUSH does 
not affect instructions in the floating point unit instruction queue. 

FEND 

0 

FLOATING POINT END. The processor asserts this signal to indicate to the 
floating point unit that the last cycle of the current processor Floating Point 
Operate instruction transfer is in progress. This allows the floating point unit to 
synchronize its operation with the processor. 


39 









Illlllllllllilllllllllllllllllllllliilllllllil MB86901 

Table 7.1 Bus Signal Descriptions (cont.) 

Signal Type Description 

FADR O FLOATING POINT ADDRESS. The processor asserts this signal to indicate that 

a valid address is present on the F<31 ;0> bus. The floating point unit should use 
FADR to latch floating point addresses. 


FINS O 



FLOATING POINT INSTRUCTION. The processor asserts FINS to indicate that 
a vaiid instruction is present on the F<31:0> bus.The floating point unit should 
use FINS to latch floating point instructions. 

SCAN CLOCKS. The scan clocks are used only during processor scan testing, 
are intended for factory use, and must be high during normal operation. 


XSM, i SCAN CLOCKS ENABLE. The XSM and XTEST signals are asserted only 

XTEST I during processor testing to disabie the system clocks and enable the Scan 

Clocks, is intended for factory use, and must be high during normal operation. 


SDI I SCAN DATA iNPUT. This is the seriai data pattern input signai which is used 

only during processor scan testing. It is intended for factory use, and must be 
high during normai operation. 

SDO O SCAN DATA OUTPUT. This is the serial data pattern output signal which is used 

only during processor scan testing. It is intended for factory use, and may be left 
unconnected. 


7.1. System interface 


The AOE, DOE, and IN_NULL input signals are 
asynchronous, disabling output drivers or inhibiting 
the NULL_CYC signal immediately when released, 
without regard to clocks. This ailows fast tran sfer of 
essentiai bus signals to other bus masters (AOE, 
DOE), an d fast nullification of system addresses 
(iN_NULL) without the “wait” for clock edges that 
would otherwise be required. 

Ali other processor signais are synchronous, allow¬ 
ing the fast addressing and dat a transfer poss ibie 
only in a synchronous system. MHOLD<A-C> are 
referenced to the faliing edge of CLK1; all other sig¬ 
nals are referenced to the rising edge of CLK1. 

The MB86901 interface signals are shown in Figure 
7.1, Processor Signais, and are fuily described in 
Table 7.1, Bus Signal Descriptions. The following 
sections contain brief descriptions of the signais. 


The System interface is comprised of signals and 
buses which interface to system logic other than the 
floating point unit. These include the Address Bus, 
the Data Bus, the Bus Cycle Definition signals, the 
Bus Control signals, the Interrupt Request Level 
bus, and the RESET and ERROR Special signals 
(see Figure 7.1). 

The System Interface protocol supports multiple bus 
masters, allows wait states to accommodate slow 
memory and I/O, and facilitates cache implementa¬ 
tion. 

The separate, non-muitiplexed Address and Data 
buses are designed for single-cycle operation 
without the design complexities inherent in the multi¬ 
plexed address/data bus used in other processors. 
The Interrupt Level Bus allows fast response to sys¬ 
tem interrupts, and increases the effective band¬ 
width of the Data Bus by allowing it to be used for 
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instruction and data transfer only. The extensive 
offering of Bus Cycle Definition and Bus Control sig¬ 
nals in the System Interface allows a high degree of 
design flexibility without performance compromise. 

These and other System Interface features combine 
with the MB86901 RISC architecture to give the pro¬ 
cessor its high throughput. 

Address Bus 

The Address Bus includes all signals necessary to 
locate instructions, data, and I/O. These include 
ASI<7:0> which select address spaces and are 
used by memory management protection logic 
which may be implemented, and the ADR<31:0> 
address bus signals which identify particular loca¬ 
tions in selected address spaces. 

The asi signals allow selection of up to 256 address 
spaces, with each address space containing as 
many as 4 gigabytes, for a maximum combined ac¬ 
cessible space of 1024 gigabytes. 

Four of the address spaces are dedicated according 
to mode (user or supervisor) and type of access (in¬ 
struction or data). 252 of the address spaces are im¬ 
plementation definable and are selected by the “asi” 
field of the Load From Alternate Space, Store Into 
Alternate Space, and Atomic Load-Store Into Alter¬ 
nate Space instructions. 

The address signals select individual words in 
accessed address spaces. 

Data Bus 

The Data Bus transfers all instructions and data be¬ 
tween the processor, and memory and I/O. It is a 
non-multiplexed, 32-bit bus capable of transferring 
one word each cycle. 

Bus Cycle Definition Signais 

The Bus Cycle Definition signals are asserted 
during bus operations to identify to the system the 
types of operations in progress as follows: 

• RD identifies operations as loads (reads) or 
stores (writes). 


• LDST identifies Atomic Load-Store operations. 

• LOCK identifies the multi-cycle Load Double and 
Store Double operations. 

• DFETCH identifies data transfer operations. 

• SIZE<1:0> identifies transferred data as bytes, 
halfwords, or words. 

Bus Control Signals 

The Bus Control signals directly control the state of 
the System Interface signals, directly control system 
logic associated with the System Interface, and in¬ 
directly control bus operations by supporting bus 
protocol as follows: 

• WE indicates that data to be stored is present on 
the Data Bus. 


• MDS indicates that data to be loaded is present 
on the Data Bus. 


• HAL inhibits latching by the address latches. 


• MHOLD<A-C> hold the processor in a wait state. 

• NULL_CYC indicates that the address in the ex¬ 
ternal address l atches is not a new valid 
address, or tha t FHOLD or one of the 
MHOLD<A-C> signals is asserted. 


• IN_NULL gates assertion of NULL_CYC. 

• TC indicates the presence of a cache. 


• MEXC indicates a memory error. 


• AOE enables the ASI<7:0>, RD, SIZE<1:0>. 
LOCK, and LDST output drivers. 


• DOE enables the D<31:0> output drivers. 

Interrupt Request Level Bus 

The IRL<3:0> Interrupt Request Level bus is used 
by system logic to report both maskable and non¬ 
maskable interrupt service requests, and their level 
(see Section 5.1.1). 
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Special Signals 


RESET and ERROR are special signals which are 
not normally asserted during operation. 


RESET is asserted by system logic to initialize the 
processor, then is released to force boo t progra m 
execution at location 0 (see Section 6). RESET is 
always asserted at power-on, but is usually asserted 
during operation only in response to a catastrophic 
system failure. 


ERROR is asserted by the processor to indicate that 
it has halted in the error state in resp onse to a trap 
occurrence while ET = 0 in the PSR. RESET must 
be asserted for recovery from the error state. 

7.1.1, Basic Timing 

Basic processor timing is shown in Figure 7.2. The 
figure shows fetching of instruction 11, followed by 
fetches of sequential instructions I2 and I3, followed 
by instruction 11 data transfer (such as a data read 
for a Load instruction), followed by instruction I4 
fetch. 

This timing is typical, showing instruction fetches or 
data transfer each cycle, with no delays (wait cy¬ 
cles) due to cache misses or memory exceptions. 

The notation with format “SJ” above each clock 
cycle identifies the instruction pipeline stage in 
which a particular instruction is being processed (S), 
and the instruction or instruction operation (I) as fol¬ 
lows: 


Pipeline Stage 

F = Fetch Stage 
D = Decode Stage 
E = Execute Stage 
W = Write Stage 
WH = Write Stage Hold 


Instruction 

IX = Instruction X 
LD = Load Operation 
ST = Store Operation 
T = Trap Dispatch 

Table instruction 


Note that instruction II is decoded and executed in 
the instruction pipeline while instructions which fol¬ 
low are fetched. 


All instructions are pipelined in this manner. 


7.1.2. Basic Data Transfer Timing 

Figures 7.3-7.7 show timing for the Load, Store, 
Load Double Word, Store Double Word, and Atomic 
Load-Store Operations. These operations are iden¬ 
tified by the assertion of DFETCH and ASI<7:0> at 
the beginning of each operation to indicate that 
data, rather than an instruction, is to be transferred 
via the D<31:0> data bus. 

The Load operation completes in one cycle as 
shown in Figure 7.3. 

RD and WE remain released during the operation to 
indicate that the processor is to read data. 

The Store Operation, shown in Figure 7.4, 
completes in two cycles. The processor releases RD 
during the fir st cy cle to indicate that it will write data, 
then asserts WE during the second cycle to indicate 
that the data to be written is asserted on the data 
bus. The processor also asserts LOCK to ind icate 
that a multi-cycle operation is in progress, and HAL 
and NULL_CYC to disable the External Instruction 
Pipeline and to indicate that the second cycle is 
“nulled,” since it is not a new operation. 

The Load Double Word operation (Figure 7.5) is es¬ 
sentially two back-to-back load operations. The pro¬ 
cessor asserts LOCK during the second cycle to 
indicate that the operation is multi-cycle. 

The Store Double Word operation (Figure 7.6) is es¬ 
sentially two back-to-back store operations. The 
second store completes in one cycle rather than 
two, however, so that the entire o perat ion completes 
in three cycles. Once again HAL, LOCK, and 
NULL_CYC are asserted as in the Store case, with 
LOCK held asserted for two cycles to indicate a 
three-cycle operation. 

The Atomic Load-Store operation (Figure 7.7) is es¬ 
sentially nondivisible—a load followed by a store. 
The processor asserts LDST to identify the opera¬ 
tion as an Atomic Load-Store. 
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Figure 7.2 Basic Timing (Cache Hit) 
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Figure 7.4 Store (Cache Hit) 
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7.1.3. Cache Miss Timing 

Figures 7.8-7.12 show timing which resuits from 
cache misses. 

The processor first tries to access instructions/data 
in the system cache, if a cache is implemented. If an 
accessed instruction/data is in the cache, a cache 
hit occurs, and the operation proceeds with no 
delays as shown in Figures 7.2-7.7. 

If an accessed instruction/data is not in the cache, a 
cache miss occurs, and the operation proceeds with 
delays as shown in Figures 7.8-7.12. 

System logi c introdu ces delay during an operation 
by asserting MHOLD as shown in the figures, which 
holds the processor in a pause state until released. 


Note that when MHOLD is asserted, the processor 
has advanced to the next instruction. External 
latches must therefore capture the bus signal states 
which correspond to the “miss” instruction (see Ex¬ 
ternal Address Pipeline, Section 8.2) for the re¬ 
quired memory access. 


The system logic strobes MDS during a “held” in¬ 
struction fetch or load operation to indicate to the 
processor that the instructi on or da ta is present on 
the data bus, then releases MHOLD to complete the 
operation . During a store operation, the logic simply 
releases MHOLD to indicate that the destination 
device (memory or I/O) has latched the data. 

Other signal timing is similar to the cache hit cases. 

7.1.4. Memory Exception Timing 

Memory exception timing for various bus operations 
is shown in Figures 7.13-7.16. 


The ope rations b egin normally. System logic then 
asserts MHOLD to hold the processor, asserts 
MEXC to re port the me mory ex ception, then re¬ 
leases both MHOLD and MEXC as shown in the 
figures to allow the processor to begin exception 
servicing. T1 and T2 are Trap Dispatch Table in¬ 
structions which vector the processor to the appro¬ 
priate trap handler. 

7.1.5. Special Timing 

Bus Request, Error/Reset, and asynchronous trap 
(interrupt) timing are shown in Figures 7.17-7.19, 
respectively. 

A bus arbiter or bus master ga ins control of the sys¬ 
tem bus by asserting BHOLD t o hold the proc essor 
in a wait state, and releasing AOE, ADROE, and 
DOE to force essential processor outputs to a high 
impedance as shown in Figure 7.17. The new bus 
master drives the three-stated lines while the pro¬ 
cessor output enable signals are released. 


The bus arbiter releases BHOLD and the processor 
output enable signals once the bus master has 
finished operations, thereby allowing the processor 
to resume execution. 


ERROR and RESET t iming are shown in Figure 

7.18. Note that RESET must be held asserted for a 
minimum of ten processor cycles to assure 
complete processor initialization. 

Asynchronous interrupt timing is shown in Figure 

7.19. IRL<3:0> = 0 indicates no interrupt. Any other 
IRL<3:0> state indicates an interrupt which may be 
recognized according to the value of the Processor 
Interrupt Level in the PSR (see Section 2.5.2). 

IRL<3:0> must be held asserted until cleared by 
software. 
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Figure 7.17 Bus Request 
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7.2. Floating Point interface 

The Floating Point Interface directly interfaces the 
processor to the Fujitsu MB86911 Floating Point 
Controller, which in turn directly interfaces to the Tl 
SN74ACT8847 Floating Point Processor (FPP). This 
allows use of the MB86911 and the FPP as a Float¬ 
ing Point Unit (FPU) for the MB86901 which ex¬ 
ecutes floating point operations concurrently with 
processor integer operations, as explained in Sec¬ 
tion 4. 

Data transfer between the processor and FPU is via 
the D<31:0> System Interface data bus. All other in¬ 
formation transfer between the processor and FPU 
is via Floating Point Interface buses and signals as 
follows: 

• F<31:0> transfers Floating Point Operate instruc¬ 
tions and their addresses to the FPU. 

• FADR indicates a valid address on the F<31 ;0> 
bus. 

• FINS indicates a valid instruction on the F<31:0> 
bus. 

• FEND indicates the last processor cycle of the 
current instruction transfer. 


• FCC<1:0> transfers floating point operation con¬ 
dition codes to the processor. 

• FCCV indic ates a valid condition code on the 
FCC<1:0> bus. 


• FEXC indicates a floating point exception. 

• FXACK indicates processor recognition of a float¬ 
ing point exception. 


• FHOLD halts the processor. 

• FLUSH forces the unit to abort the current in¬ 
struction. 

• FP indicates the presence of an FPU. 


7.2.1. FPOP instruction Transfer Timing 

Timing for Floating Point Operate (FPOP) instruction 
transfers are shown in Figures 7.20 and 7.21. 

Figure 7.20 shows FPOP instruction transfer without 
delays. The FPOP instruction is transferred during 
the first cycle, followed by transfer of the FPOP in¬ 
struction address the second cycle. The instruction 
address is latched by the FPU to support recovery 
from faults which may occur (see FPU data sheets). 

The processor asserts FINS during the first cycle to 
indicate to the FPU that it is an instruction cycle, 
asserts FADR during the second cycle to indicate 
that it is an address cycle, and asserts FEND after 
the second cycle to indicate that the operation is 
completed. 

Figure 7.21 shows FPOP instruction transfe r timing 
with delay cycles. The FPU has asserted FHOLD 
during the operation as shown in the figure, h olding 
the processor in a wait state until FHOLD is re¬ 
leased. 

7.2.2. FP Load And Store Timing 

Timing for the floating point Load, Store, Load 
Double Word, and Store Double Word instructions 
are shown in Figures 7.22-7.25. These instructions, 
containing memory source or destination operand 
addresses, are transferred to the FPU for execution. 

The processor asserts DFETCH at the beginning of 
each operation to indicate that data will be trans¬ 
ferred, transfers the instruction during the first cycle 
of the operation, then transfers the instruction 
address during the second cycle. 

The FPU transfers data to/from memory via the 
D<31 ;0> bus as shown in the figures. 

7.2.3. Cache Miss and Exception Timing 

Cache miss timing, memory exception timing, float¬ 
ing point exception timing, and various combinations 
of each are shown in Figures 7.26-7.34. This timing 
is similar to that of non-floating point operations 
(see Sections 7.1.3 and 7.1.4), with floating point 
exceptions resulting in the same timing as memory 
exceptions. 
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Figure 7.24 FP Load Double (Cache Hit) 
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Figure 7.30 FP Load Exceptions 
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Figure 7.34 FPOP Floating Point Exception 
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7.3. System Configuration 

Figure 7.35 shows a basic MB86901 system con¬ 
figuration utiiizing an FPU, an interrupt controiier, 
and a bus arbiter. The diagram is for illustration pur¬ 
poses only, and may require modifications such as a 
separate cache bus, for instance, to minimize sys¬ 
tem bus loading. 

The FPU consists of the Fujitsu MB68911 RISC 
Floating Point Controller and the Tl SN74ACT8847 
floating point chip, interfaced to the processor via 
the processor Floating Point Interface and the Sys¬ 
tem Interface D<31:0> bus. 

Data is transferred between the processor and the 
FPU via the processor System Interface D<31:0> 
bus. All other signals are transferred via the Floating 
Point Interface as shown. 

The Interrupt Controller receives interrupt requests 
from system peripheral devices, prioritizes or other¬ 
wise arbitrates multiple interrupt requests, then 
asserts the selected interrupt request level number 


on the IRL<3:0> bus to alert the processor of the 
pending interrupt. The controller holds the level 
number asserted until the processor acknowledges 
recognition of the request via software, typically by 
accessing a location that the Interrupt Acknowledge 
Decode Logic recognizes as an interrupt acknow¬ 
ledge. 

The Bus Arbiter receives bus requests from bus 
masters for control of the system bus, halts the pro¬ 
cessor and three-states its bus drivers in response 
to a bus request, then grants the system bus to the 
requesting bus master. The requesting bus master 
holds its bus request asserted until finished with the 
bus. 

The Bus Arbiter halts the processor by asserting 
MHOLD. It three-states all of the processor System 
Inter face output drivers excep t the DFETCH and 
HAL drivers by releasing the AOE, ADROE, and 
DOE driver enable signals. The arbiter asserts the 
driver enable signals once again when the current 
bus master releases its bus request, allowing the 
processor to continue execution. 


Figure 7.35 Basic System Configuration 
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Note that ASI<7:0>, ADR<31:0>, and several Sys¬ 
tem Interface control and cycle-definitio n sig nals are 
externally latched under control of the HAL proces¬ 
sor signal. These latches, part of the External 
Address Pipeline, are explained in Section 8.2. 

8. System Design Considerations 

System design with the MB86901 is similar to de¬ 
sign with other processors. Two areas of system de¬ 
sign, however, require special consideration: the 
clock generation circuit, and the External Address 
Pipeline. 

Examples of each are described in the following 
sections, although other logic which adheres to the 
processor timing specifications may be used. 

8.1. Clock Generator 

To generate CLK1 and CLK2 accurately, a 4x base 
clock frequency, operating at 100 MHz (G25), is re¬ 
quired. With today's high frequency crystal oscilla¬ 
tors, and “AS” type TTL flip fiops going beyond 100 
MHz, this poses no problem. 


Figure 8.1 illustrates an example of such a clock 
generator based on the frequency division principle. 
The 2 units, U2a and U2b are initially set to op¬ 
posite states. They will maintain this interreiation- 
ship indefinitely. The two gates U3a and U3b are 
used to derive the 75/25% and 25/75% cycie for 
CLK1 and CLK2 respectively. 

To minimize any skew between CLK1 & CLK2, U3a 
and U3b must be part of a 74AS1000 unit, while 
U2a and U2b must occupy one 74AS109 unit. 

8.2. External Address Pipeline 

Figure 8.2 shows an External Address Pipeline and 
associated logic. 

The ADR<31:0> Address Bus is unlatched, and 
therefore not valid for the duration of a bus cycle. An 
external set of registered latches (U2) is therefore 
required to maintain the address bus state 
throughout a bus cycle, and drive any cache or 
memory subsystem. 


Figure 8.1 Clock Circuit 
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During a cache miss, the processor is typically held 
in the cycle following the one that actualiy caused 
the miss. A second set of registered latches (U3) is 
therefore required to generate the previous address 
and associated cycle parameters. 

Cloc king of the first level registers is done b y gat ing 
HAL with CLK1. The processor controls HAL to 
maintain address integrity on the first set of latches. 
To utilize the address setup time, and not exceed 
the hold time, CLK1 shouid also be delayed by one 
gate level before entering the processo r. Ide ally the 
gates used to invert CLK1, and gate HAL, would 
reside in one package to minimize skew. The first 
level registers are only tri-stated during miss pro¬ 
cessing. 


The second level register s must be clocked by gat¬ 
ing CLK1 with MHOLD to guarantee a stable 
address during miss processing. They are normally 
tri-stated as they only are used during miss pro¬ 
cessing. 

9. Processor Specifications 

Figures 9.1-9.10 show MB86901 timing specifica¬ 
tions, test load, and output delay. AC timing parame¬ 
ters are listed in Table 9.4. 

All timing parameters are referenced to the 1.5V 
midpoint of CLK1 rising edges. 


Figure 8.2 Externai Address Pipeiine 



MHOLD C-CYCLE P-CYCLE 

(Current Cycle) (Previous Cycle) 
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Table 9.1 Absolute Maximum Ratings 


Rating 

Symbol 

Min. 

Max. 

Unit 

Supply Voltage 

^DD 


6.0 

V 

Input Voltage 

''t 

Vss-O-5 

0.5 

V 

Output Voltage 


Vss-» = 


V 

Temperature Under Bias 

^bias 

-40 

-25 

125 

85 

•c 

T, . Ceramic 

Storage Temperature Plastic 

stg 

-65 

-40 

150 

125 

• c 


Note: (*) Permanent device damage may occur if Absolute Maximum Ratings 

are exceeded. Functional operation should be restricted to the conditions 
as detailed in the operational sections of the data sheet. Exposure to 
absolute maximum rating conditions for extended periods may affect 
device reliability. 


Table 9.2 Recommended Operating Conditions 


Parameter 

Symbol 

Condition 

Min 

Typ 

Max 

Units 

Supply Voltage 

^DD 


4.75 

5.0 

5.25 

V 

Power Supply Current 

I DDS 

V,H = , V„ = Vs3 

0 


0.2 

mA 

Output High Voltage 

^OH 

Iqh = -2 mA 

4.0 


^DD 

V 

Output Low Voltage 

VoL 

I = 3.2 mA 



0.4 

V 

Input High Voltage 

V,H 


2.2 


— 

V 

Input Low Voltage 

V,i. 


— 


0.8 

V 

Input Leakage Current 

I IN 

ViN = 0 to Vj)j) 

-10 


10 

uA 

Tri-state Leakage Current 

I IN 

ViN = 0 to Vj)j) 

-40 


40 

uA 

Operating Temperature 

_L_ 


0 


70 

•c 


Table 9.3 Capacitance & Termination 


( T = 25 C , = 0 Volt , Frequency = 1 MHz) 


Parameter 

Symbol 

Min. 

Typ. 

Max. 

Unit 

Input Pin Capacitance 

C,N 



16 

pF 

Output Pin Capacitance 

^OUT 



16 

pF 

I/O Pin Capacitance 

^I/O 



16 

pF 

Input Pull-Up Resistor Vjh= 

ViL = 

RP 

25 


100 

KIl 
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Figure 9.2 Clock AC Measurement Points 
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MB86901 Clock requirement 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 

Min 

Max 

Min 

Max 

1 

System Clock Cycle Time (T) 


50 


40 

ns 


2 

System Clock Rise Time 


3 


2 

ns 


3 

System Clock Fall Time 


3 


2 

ns 


4 

System Clocks (Clkl, Clk2) Skew Time 


+/- 4 


+/- 3 

ns 

Both CLKl &CLK2 
High Time = 37.5nS 
Low Time = 12.5 nS 

5 

System Clock (Clkl) High Time 

35 


28 


ns 


6 

System Clock (Clkl) Low Time 

10 


8 


ns 


7 

System Clock (Clk2) High Time 

20 


16 


ns 


8 

System Clock (Clk2) Low Time 

11 


9 


ns 



Figure 9.3 Clocks Timing Diagram 
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Table 9.4 AC Characteristics (cent.) 


5V i 5% = 0 to +70 *C Capacitance = 50 pF 


MB86901 Address & Data Bus 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 

Min 

Max 

Min 

Max 

9 

ADR Valid Delay, From CLKl Rising Edge 


46 


37 

ns 


10 

ADR Hold, From CLKl Rising Edge 

7 


6 


ns 


11 

ASI Valid Delay, From CLKl Rising Edge 


29 


23 

ns 


12 

ASI Hold, From CLKl Rising Edge 

7 


5 


ns 


13 

Read Data Setup, Before CLKl Rising Edge 

4 


4 


ns 


14 

Read Data Hold, After CLKl Rising Edge 

5 


4 


ns 


15 

Write Data Valid Delay 

From CLKl Rising Edge 


33 


26 

ns 


16 

Write Data Hold After CLKl Rising Edge 

6 


5 


ns 


17 

Write Data Turn Off From CLKl 


31 


26 

ns 


17a 

Write Data Turn On From CLKl 

11 

27 

9 

23 




Figure 9.4 Address and Data Bus 
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Table 9.4 AC Characteristics (cent.) 

V^= 5V i 5% T» = 0 to +70 *C Capacitance = 50 pF 

CC /V 


MB86901 Address, Data & Control Tri-State 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 

Min 

Max 

Min 

Max 

18 

/AOE Turn Off Time 

6 

29 

5 

23 

ns 


19 

/AOE Turn On Time 

6 

19 

5 

15 

ns 


20 

/ADROE Turn Off Time 

6 

29 

5 

23 

ns 


21 

/ADROE Turn On Time 

6 

19 

5 

15 

ns 


22 

/DOE Turn Off Time 

8 

33 

5 

26 

ns 


23 

/DOE Turn On Time 

8 

23 

5 

18 

ns 



Figure 9.5 Address, Data and Control Tri-state 
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Table 9.4 AC Characteristics (cent.) 

5V i 5% 1^= 0 to +70 *C Capacitance = 50 pF 


MB86901 Control Signals, Output 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 

Active 

Hold 

Active 

Hold 

24 

SIZE, From CLKl Rising Edge 

22 

6 

18 

5 

ns 

Max Rating 
for Active 

Min Rating 
for Hold 

25 

RD, From CLK! Rising Edge 

22 

6 

18 

5 

ns 

26 

/WE, From CLKl Rising Edge 

23 

6 

18 

5 

ns 

27 

LOST, From CLKl Rising Edge 

22 

6 

18 

5 

ns 

28 

29 

NULL^CYC, From CLKl Rising Edge 

From /MHOLD Falling Edge 

40 

24 

6 

32 

19 

5 

ns 

30 

31 

/HAL From CLKl RisiM Edge 

From /MHOLD Falling Edge 

37 

23 

6 

30 

18 

5 

ns 

32 

LOCK, From CLKl Rising Edge 

25 

6 

20 

5 

ns 

33 

DFETCH, From CLKl Rising Edge 

32 

6 

26 

5 

ns 

34 

/ERROR, From CLKl Rising Edge 

32 

6 

26 

5 

ns 


Figure 9.6 Control Signals, Output 
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Table 9.4 AC Characteristics (cent.) 

5V i 5% ® +70 t Capacitance = 50 pF 
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MB86901 Control Signals, Input 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 

Setup 

Hold 

Setup 

Hold 

35 

36 

/MDS, Before CLKl Falling Edge 

Before CLKl Rising Edge 

8 

25 

1 


1 

ns 

Min Rating 
for Setup 
and Hold 

37 

38 

/MEXC, Before CLKl Falling Edge 

Before CLKl Rising Edge 

8 

25 

1 

7 

20 

1 

ns 

ns 

39 

/MHOLDA, Before CLKl Falling Edge 

16 

0 

13 

0 

ns 

40 

/MHOLDB, Before CLKl Falling Edge 

16 

0 

13 

0 

ns 

41 

/MHOLDC, Before CLKl Falling Edge 

16 

0 

13 

0 

ns 

42 

/SHOLD, Before CLKl Falling Edge 

16 

0 

13 

0 

ns 

43 

/BHOLD, Before CLKl Falling Edge 

16 

0 

13 

0 

ns 

44 

IRL, Before CLKl Rising Edge 

11 

4 

9 

3 

ns 

45 

/RESET, Before CLKl Rising Edge 

Active Time ('cycles') 





ns 

T 


Figure 9.7 Control Signals, Input 
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Table 9.4 AC Characteristics (cent.) 

5V i 5% = 0 to +70 *C Capacitance = 50 pF 


MB86901 FP Bus, Output 


Symbol 

Parameter 

20 MHz 

25 MHz 

Units 

Test 

Conditions 



Active 

Hold 

Active 

Hold 


46 

F<31:0>, From CLKl Rising Edge 

41 

6 

33 

5 

ns 


47 

FINS, From CLKl Rising Edge 

32 

6 

26 

5 

ns 

Max Rating 

48 

FADR, From CLKl Rising Edge 

32 

6 

26 

5 

ns 

for Active 

49 

FEND, From CLKl Rising Edge 

32 

6 

26 

5 

ns 

Min Rating 
for Hold 

50 

FLUSH, From CLKl Rising Edge 

26 

6 

22 

5 

ns 


51 

FXACK, From CLKl Rising Edge 

32 

6 

26 

5 

ns 



Figure 9.8 FP Bus, Output 
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Figure 9.10 Signal Output Test Load 
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Figure 9.11 Maximum Output Delay vs. Capacitance Loading 
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Figure 9.12 179-Leacl Plastic Pin Grid Array Package 
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Figure 9.13 Pin Assignment 
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Figure 9.14 MB86901 Pin Out by Pin Number 


PIN 

I/O 

NAME 

PIN 

I/O 

NAME 

PIN 

I/O 

NAME 

1 

O 

ADR10 

61 

O 

ADR27 

121 

O 

ASI3 

2 

O 

ADR02 

62 

O 

ADR25 

122 

O 

ASI6 

3 

O 

ADR13 

63 

O 

ADR28 

123 

1 

RESET_ 

4 

O 

ADR24 

64 

- 

VSS 

124 

O 

DFETCH 

5 

O 

ADR30 

65 

O 

ADR23 

125 

1 

FEXC_ 

6 

1 

FCCV 

66 

1 

XTST 

126 

1 

AOE_ 

7 

O 

ADR20 

67 

o 

ADR11 

127 

I/O 

D03 

8 

O 

ADR21 

68 

o 

ADR16 

128 

I/O 

D30 

9 

O 

ADR19 

69 

o 

ADR08 

129 

- 

VSS 

10 

O 

ADR22 

70 

1 

TC_ 

130 

I/O 

D21 

11 

O 

ADR06 

71 

o 

RD 

131 

I/O 

D13 

12 

O 

FXACK 

72 

1 

MHOLDB_ 

132 

I/O 

D05 

13 

O 

ADR03 

73 

o 

SIZEO 

133 

I/O 

D24 

14 

- 

VSS 

74 

o 

ASI2 

134 

I/O 

D04 

15 

1 

IRL2 

75 

I/O 

D02 

135 

o 

F18 

16 

o 

ASM 

76 

1 

MEXC_ 

136 

o 

F26 

17 

o 

FLUSH 

77 

o 

ASI5 

137 

o 

F02 

18 

o 

WE_ 

78 

1 

SHOLD_ 

138 

o 

F27 

19 

1 

IRL3 

79 

o 

FINS 

139 

o 

F07 

20 

o 

HAL_ 

80 

1 

DOE_ 

140 

o 

F20 

21 

o 

LOCK 

81 

o 

NULL_CYC 

141 

o 

F12 

22 

o 

LOST 

82 

I/O 

D27 

142 

o 

F29 

23 

1 

SDI 

83 

o 

SDO 

143 

1 

XACK 

24 

o 

ASI4 

84 

I/O 

D26 

144 

- 

VSS 

25 

1 

MDS_ 

85 

o 

ERROR_ 

145 

o 

ADR01 

26 

1 

FHOLD 

86 
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MB86901 


ERRATA 


Page Correction 

22,23 On page 22, paragraph 2, line 2, and 
page 23, paragraph 3, line 2, replace 
“r[rs1] = r[rs2]” with “r[rs1] + r[rs2]”. 

34 In Figure 7.1, Processor Signals, on the 
Bus Con trol input line for MHOLD <A-C>, 
SHOLD, replace the value “5” with a “4". 

53 The correct title of Figure 7.11 is: Load 
Double (Cache Miss). The correct Store 
Double (Cache Miss) figure is shown 
below: 


54 In Figure 7.12, Atomic Load-Store 

(Cache Miss), replace “AL<17:0>” with 
“ADR<31:0>”. 

95 Under Ordering Information (opposite 

errata page), replace “MB86901-APR-G” 
with “MB86901ACR-G” and replace 
“MB86901-APR-G25” with 
“MB86901ACR-G25”. 


STORE DOUBLE WORD (CACHE MISS) 
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FUJITSU MICROELECTRONICS, INC. SALES OFFICES 


CALIFORNIA 

10600 N. DeAnza Blvd., # 225 
Cupertino, CA 95014 
(408) 996-1600 

840 Newport Center Dr., # 460 
Newport Beach, CA 92660-6323 
(714) 720-9688 

GEORGIA 

3500 Parkway Lane, #210 
Norcross, GA 30092 
(404) 449-8539 

ILLINOIS 

One Pierce Place, #910 
Itasca, IL 60143-2681 
(312) 250-8580 

MASSACHUSETTS 

75 Wells Ave., # 5 

Newton Center, MA 02159-3251 

(617) 964-7080 


MINNESOTA 

3460 Washington Dr., #209 
Eagan, MN 55122-1303 
(612) 454-0323 

NEW YORK 

601 Veterans Memorial Highway, #P 
Hauppauge, NY 11788-1054 
(516) 361-6565 

OREGON 

5285 S.W. MEADOWS RD., #222 
LAKE OSWEGO, OR 97035-9998 
(503) 684-4545 

TEXAS 

14785 Preston Rd., #670 
Dallas, TX 25240 
(214) 233-9394 


Ordering Information 


Part Number 

Clock Speed 

MB86901-APR-G 

20 MHz 

MB86901-APR-G25 

25 MHz 
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Advanced Products Division 


For further information outside the U.S., please contact: 

FUJITSU LIMITED 


Semiconductor Marketing 

Furukawa Sogo Bldg, 6-1 Marunouchi, 2-chome 

Chiyoda-ku, Tokyo 100, Japan 

TEL 011-81-3-216-3211 • FAX 011-81-3-216-9771 

FUJITSU MIKROELEKTRONIK GmbH 


Arabella Center 9.0G./A 

Lyoner Strasse 44-48, D-6000, Frankfurt am Main 71, F.R. Germany 
TEL 011-49-69-66320 • TELEX 0411 963 

FUJITSU MICROELECTRONICS PACIFIC ASIA LIMITED 


805 Tsimshatsui Centre. 66 Mody Road 

Tsimshatsui East, Kowloon, Hong Kong 

TEL 3-732-0100 • FAX 3-722-7984 • TELEX 31959 FUJIS HK 



FUJITSU MICROELECTRONICS. INC. 

Advanced Products Division 

50 Rio Robles. San Jose, CA 95134-1804 • TEL (408) 922-9000 • FAX (408) 432-9070 

*SPARC is a trademark of Sun Microsystems, inc. 
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